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Introduction 



M. Gromov pages 85-86: 

"3.15. Proposition: Let (V,g) be a Riemannian manifold with g continuous. 
For each v € V the spaces \(V,v) Lipschitz converge as A — > oo to the tangent space 
(T v V,0) with its Euclidean metric g v . 

Proof _|_: Start with a C 1 map (R n ,0) — > (V,i>) whose differential is isometric at 0. 
The A-scalings of this provide almost isometries between large balls in R n and those in 
XV for A — > oo. 

Remark: In fact we can define Riemannian manifolds as locally compact path 
metric spaces that satisfy the conclusion of Proposition 3.15. " 

If so, Gromov's remark should apply to any sub-Riemannian manifold. Why then 
is the sub-Riemannian case so different from the Riemannian one? Here is a list of 
legitimate questions: 

How can one define the manifold structure? Who are the tangent and cotangent 
bundles? What is the intrinsic differential calculus? Why are there abnormal geodesies 
if the Hamiltonian formalism on the cotangent bundle were complete? If the manifold 
is a compact Lie group does the tangent bundle carry a natural group structure? What 
are differential forms, de Rham cochain, and the variational complex? Consider the 
group of smooth volume preserving transformations. Why does this group have more 
invariants than the volume and what is the interpretation of these invariants? 

The purpose of this working seminar was to explore as many as possible open 
questions from the list above. Special attention has been payed to the case of a Lie 
group with a left invariant distribution. 

The seminar, organised by the author and Tudor Ra^iu at the Mathematics De- 
partment, EPFL, started in November 2001. 

The paper is by no means self contained. For any unproved result it is indicated 
the place where a complete proof can be found. The choice of the proofs is rather 
psychological: some of them have a geometrical or mixed geometric-analytical mean- 
ing (like the proof of Hopf-Rinow theorem), others help to understand that familiar 
reasoning applies in apparently unfamiliar situations (like in the preparations for the 
Pansu-Rademacher theorem). Important results (Ball-Box theorem for example) have 
elementary proof in particular situations; this might help to better understand their 
meaning and also their strength. 

These pages covers the expository talks given by the author during this seminar. 
However, this is the first part of three, in preparation, dedicated to this subject. It 
covers, with mild modifications, an elementary introduction to the field. 

The second part will deal with the applications of the Local-to-Global principle for 
moment maps arising in connection with Carnot groups. We prove, for example, that 
Kostant nonlinear convexity theorem 4.1 [18] can be seen as the sub-Riemannian version 



of the linear convexity theorem 8.2. op. cit. The key point is the redefinition of the 
tangent bundle of a Lie group endowed with a left invariant nonintegrable distribution, 
using a natural construction based on noncommutative derivative. 

The third part is devoted to the study of some representations of groups of bi- 
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Lipschitz maps on Carnot groups. The main idea is that one can formulate an inter- 
esting rectifiability theory as a theory of (irreducible) representations of such groups. 

Acknowledgements. I would like to express here my gratitude to Tudor Ratiu 
for his continuous support and for the subtle ways of sharing his wide mathematical 
views. I want also to thank to the participants of this seminar and generally for the 
warm athmosphere that one can encounter at the Mathematics Department of EPFL. 
I cannot close the aknowledgements without mentioning how much I benefit from the 
every-day impetus given by a force of nature near me, which is my wife Claudia. 
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1 From metric spaces to Carnot groups 

1.1 Metric spaces 

Definition 1.1 (distance) A function d : X x X — > [0, +oo) is a distance on X if: 

(a) d(x,y) = if and only if x = y. 

(b) for any x,y d(x,y) = d(y,x). 

( c) for any x, y, z d(x, z) < d(x, y) + d(y, z) . 

(X, d) is called a metric space. The open ball with centre x £ X and radius r > is 
denoted by B(x, r). 

If d ranges in [0, +oo] then it is called a pseudo-distance. The property of two 
points of being at finite distance is an equivalence relation, d is then a distance on each 
equivalence class (leaf). 

Definition 1.2 A map between metric spaces f : X — > Y is Lipschitz if there is a 
positive constant C such that for any x,y £ X we have 

d Y (f(x),f(y)) < Cd x (x,y) 

The least such constant is denoted by Lip(f) . 

The dilatation, or metric derivative, of a map f : X — > Y between metric spaces, in 
a point u € Y is 

dil(f)(u) =limsup sup { ^ Y ; v / w , v,w <G B(u,e) 

e^O { d X {v,w) 

The distorsion of a map f : X —>Y is 

dis f = sup {\d Y (f(y),f{y'))-d x (y,y')\ : y,y' e X) 

The name "metric derivative" is motivated by the fact that for any derivable function 
/ : R -» R n the dilatation is dil{f)(t) = \ f(t) |. 

A curve is a function / : [a, b] — > X. The image of a curve is called path. Length 
measures paths. Therefore length does not depends on the reparametrisation of the 
path and it is additive with respect to concatenation of paths. 

In a metric space (X, d) one can measure the length of curves in several ways. 

The length of a curve with L 1 dilatation / : [a, b] — > X is 

L(f)= f dil{f){t) dt 

J a 

A different way to define a length of a curve is to consider its variation. 
The curve / has bounded variation if the quantity 

(n-l 

Var(f) = supj J^d(/(ti),/(t i+ i)) : o < ti < ... < t n < b 
{ i=i 
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(called variation of /) is finite. 

The variation of a curve and the length of a path , as defined previously, do not 
agree in general. To see this consider the following easy example: / : [—1,1] — ► R 2 , 
f(t) = (t, sign(t)). We have Var (/) = 4 and L(/([-l, 1]) = 2. Another example: 
the Cantor staircase function is continuous, but not Lipschitz. It has variation equal 
to 1 and length of the graph equal to 2. 

Nevertheless, for Lipschitz functions, the two definitions agree. 

Theorem 1.3 For each Lipschitz curve f : [a, b] — > X, we have L(f) = Var(f). 
For the statement and the proof see the second part of Theorem 4.1.1., Ambrosio 



Proof. We prove the thesis by double inequality. / is continuous therefore f([a,b\) 
is a compact metric space. Let {x n : n G N} be a dense sequence in f([a,b\). All the 
functions 

1 1-> 4>„(t) = d(f(t),x n ) 
are Lipschitz and Lip((f> n ) < Lip(f), because of the general property: 

Lip{f o g) < Lip(f) Lip(g) 

if /, g are Lipschitz. In the same way we see that : 

dil(f){t) = sup{di/(0 n )(t) : neN} 

We have then, for t < s in [a, b]: 

d(f(t),f(s)) = sup{| d(f(t),x n )-d(f(s),x n ) | :n£N} < 

< f dil(f)(r) dr 



From the definition of the variation we get 

Var{f) < L(f) 

For the converse inequality let e > and n > 2 natural number such that h = (b 
a)/n < e. Set U = a + ih. Then 



\! £ d(f(t + h)J(t))dt < i^d(/( r + t i+1 ),/( T + ^))dr < 



- \ t Varu) dr = Var{f) 



Fatou lemma and definition of dilatation lead us to the inequality 

" £ dil(f)(t) dt < Var(f) 
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This finishes the proof because e is arbitrary. ■ 

There is a third, more basic way to introduce the length of a curve in a metric 
space. 

The length of the path A = /([a, b]) is by definition the one-dimensional Hausdorff 
measure of the path. The definition is the following: 



1(A) 



lim inf < diam E{ : diam Ei < 5 , A C Ei > 
Kiel iei J 



The general Hausdorff measure is defined further: 

Definition 1.4 Let k > and (X,d) be a metric space. The k- Hausdorff measure is 
defined by: 

H k {A) = lim inf \ ^ (diam Eif : diam E { < 5 , Ac\JeA 
^° I iei iei J 

Any k-Hausdorff measure is an outer measure. The Hausdorff dimension of a set A is 
defined by: 

H-dim A = inf jfc > : H k (A) = o} 

Variation of a curve and the Hausdorff measure of the associated path don't coincide. 
For example take a circle S 1 in R n parametrised such that any point is covered two 
times. Then the variation is two times the length. 

Theorem 1.5 Suppose that f : [a,b] — > X is an injective Lipschitz function and A = 
f([a,b]). Then 1(A) = Var{f). 

For the statement and the proof see Theorem 4.4.1., Ambrosio 0. 

In order to give the proof of the theorem we need two things. The first is the geomet- 
rically obvious, but not straightforward to prove in this generality, Reparametrisation 
Theorem (theorem 4.2.1 Ambrosio Q). 

Theorem 1.6 Any path A C X with a Lipschitz parametrisation admits a reparametri- 
sation f : [a, b] — * A such that dil(f)(t) = 1 for almost any t € [a, b]. 

The second result that we shall need is Lemma 4.4.1 Ambrosio Q. 

Lemma 1.7 If f : [a,b] — > X is continuous then 

HHfifab]) < d(f(a),f(b)) 



1 FROM METRIC SPACES TO CARNOT GROUPS 



8 



Proof. Let us consider the Lipschitz function 

c^-.X^R, </,(x) = d(x,f(a)) 

It has the property Lip((p) < 1 therefore by the definition of Hausdorff measure we 
have 

W^o/CMD) < H\f([a,b]) 

On the left hand side of the inequality we have the Hausdorff measure on R, which 
coincides with the usual (outer) Lebesgue measure. Moreover <j) o f([a, b\) = [0, a], 
therefore we obtain 

ft^o /([«,&])) = sup{d(/(t),/(o)):t€[o,6]} > d(f(a),f(b)) 



The proof of theorem 1.6 follows. 



Proof. It is not restrictive to suppose that A = f([a,b]) can be parametrised by 



/ : [a, b] — ► A such that dil(f)(t) = 1 for all t G [a, b]. Due to theorem L3 and again 
the reparametrisation theorem, we can choose [a, b] = [0,Var(f)]. 

For an arbitrary 5 > we choose n € N such that h = Var(f) jn < 5 and we divide 
the interval [0, Var(f)] in intervals Jj = [ih, (i + l)h]. The function / has Lipschitz 
constant equal to 1 therefore (see definition of Hausdorff measure and notations therein) 

n 

Hs(A) < J2 diam ( J i) = Var U) 

i=0 

5 is arbitrary therefore W 1 (A) < Var(f). This is a general inequality which does not 
use the injectivity hypothesis. 

We prove the converse inequality from injectivity hypothesis. Let us divide the 



interval [a,b] by a < to < ... < t n < b. From lemma 1.7 and sub-additivity of Hausdorff 
measure we have: 



n—1 n—1 



£ d(f(ti), f(t i+1 )) < ^HHfmU+i])) < H\A) 

i=0 i=0 

The partition of the interval was arbitrary, therefore Var(f) < 7i 1 (^4). ■ 

In conclusion, in any metric space we can measure length of Lipschitz curves fol- 
lowing one of the recipes from above, or we can measure length of paths with the 
one-dimensional Hausdorff measure. The lengths agree if the path admits a Lipschitz 
parametrisation. 

We shall denote by Id the length functional, defined only on Lipschitz curves, in- 
duced by the distance d. 

The length induces a new distance di, say on any Lipschitz connected component 
of the space (X,d). The distance di is given by: 

di(x,y) = inf {l d (f([a,b])) : f:[a,b]->X Lipschitz , 
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f(a) = x , f(b) = y} 

We have therefore two operators d i— > Id and I i— > d/. Is one the inverse of another? 
The answer is no. This leads to the introduction of path metric spaces. 

Definition 1.8 A path metric space is a metric space (X,d) such that d = d\. 

In terms of distances there is an easy criterion to decide if a metric space is path 
metric (Theorem 1.8., page 6-7, Gromov p3| ). 

Theorem 1.9 A complete metric space is path metric if and only if (a) or (b) from 
above is true: 

(a) for any x, y G X and for any e > there is z £ X such that 

max {d(x,z),d(z,y)} < -d(x,y)+£ 

(b) for any x,y £ X and for any ri,r 2 > 0, if n + r 2 < d(x, y) then 

d(B(x,n),B(y,r 2 )) < d(x,y)-r 1 -r 2 

Proof. We shall prove only that (a) implies that X is path metric, (b) implies (a) is 
straightforward, path metric space implies (b) likewise. 

Set S = d(x, y) and take a sequence > 0, with finite sum. We shall recursively 
define a function z = z £ from the dyadic numbers in [0, 1] to X. z e {l/2) = z±/ 2 1S a 
point such that 

max{d(x,z 1 / 2 ),d{z 1 / 2 ,y)} < -<5(l + ei) 

Suppose now that all points z £ (p/2 n ) = z p / 2 n were defined, for p = 1, 2 n — 1. Then 
z 2p+i/2™+ 1 is a point such that 

max{d(z p / 2 n,z 2p+1 / 2 n+i),d(z 2p+ i /2 n+i,z p+1 / 2n )} < -^Yli^+^k) 

k=l 

Because (X, d) is a complete metric space it follows that z £ can be prolonged to a 
Lipschitz curve c £ , defined on the whole interval [0, 1], such that c(0) = x, c(l) = y and 

d(x,y) < l(c £ < d(x,y) JJ(l+£ fe ) 

But the product Hfc>i(l + £ k) can be made arbitrarily close to 1, which proves the 
thesis. ■ 

Path metric spaces are geometrically more interesting, because one can define 
geodesies. Let c be any curve; denote the length of the restriction of c to the interval 
t,t' by l c {t,t'). 
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Definition 1.10 A (local) geodesic is a curve c : [a,b] —* X with the property that for 
any t G (a, b) there is a small e > such that c : [t — e, t + e] — > A zs length minimising. 
A global geodesic is a length minimising curve. 

Therefore in a path metric space a local geodesic has the property that in the 
neighbourhood of any of it's points the relation 

d(c(t),c(t')) = l c (t,t>) 

holds. Any global geodesic is also local geodesic. 
Can one join any two points with a geodesic? 



The abstract Hopf-Rinow theorem (Gromov 13], page 9) states that: 



Theorem 1.11 If (X,d) is a connected locally compact path metric space then each 
pair of points can be joined by a global geodesic. 



Proof. It is sufficient to give the proof for compact path metric spaces. Given the 
points x,y, there is a sequence of curves fh joining those points such that l(fh) < 
d(x, y) + l/h. The sequence, if parametrised by arclength, is equicontinuous; by Arzela- 
Ascoli theorem one can extract a subsequence (denoted also fh) which converges uni- 
formly to /. By construction the length function is lower semicontinuous hence: 

1(f) <liminfZ(/ h ) <d(x,y) 

h^oo 

Therefore / is a length minimising curve joining x and y. ■ 

The following remark of Gromov 1.13(b) m easily comes from the examination of 
the previous proof. 

Proposition 1.12 In a compact path metric space every free homotopy class is repre- 
sented by a length minimising curve. 



1.2 Local to Global Principle 

Let (X, d) be a connected locally compact and complete path metric space. 

The natural notion of subspace in this category of metric spaces is the following 
one. 



Definition 1.13 A set C C X is a closed convex set if the inclusion (C,dc) C (X, d) 
is an isometry, where dc is the inner path metric distance: 

dc(x,y) = inf {l d (f([a,b})) : f : [a,b] -> X Lipschitz , 

f(a)=x, f(b)=y, f([a,b])cC} 

We shall call a closed set weakly convex if any two points of the set can be joined 
by a local geodesic lying in the set. 
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A closed set it is therefore convex if any two points x, y G C can be joined by a 
global geodesic lying in C. Note that there might be other global geodesies joining two 
points in C which are not entirely in C. 

The purpose of the remainder of this section is to state and prove the Local-to- 
Global Principle. The principle has been formulated for the first time by Hilgert, Neeb, 



Plank [15]. It has been used to prove convexity theorems; it's most widely known 
application concerns an alternative proof of the Atiyah-Guillemin-Sternberg theorem 
about convexity of the image of the moment map. We shall prove in this section 
a general version of the principle, true in path metric spaces. Applications of this 
principle will be given further, where we will meet also the moment map. 

Definition 1.14 A pointed closed convex set is a pair (C,x) with iGCel and C 
closed convex set with the property that there is a constant K > 1 such that for any 
r > and any two points y,y' G CD B(x,r) can be joined by a global geodesic in 
CnB(x,Kr). 

Let X be a connected, compact topological Hausdorff space, Y a connected locally 
compact path metric space and / : X — > Y with the following properties: 

LCI. / is proper and for any x G X there is a neighbourhood V x of x such that 
f {f(y)) l~l Vx is connected for all y G V x . 

LC2. for any x G X there is a pointed closed convex (C x ,f(x)) and a neighbourhood 
V x of x such that / : V x — ► C x is open. 

Theorem 1.15 (Local-to- Global Principle). Under the hypotheses LCI, 
LC2, the image f(X) is closed weakly convex in Y. 



The proof is inspired by the one from Sleewaegen [30], section 1.8, Theorem 1.8.1 



Proof. The proof splits in two steps, one of topological nature and the other of metric 
character. 

Step 1. Let us consider on X the equivalence relation x « y if f(x) = f(y) 
and x,y are in the same connected component of / _1 (/(x)). Denote by X the set 
of equivalence classes, endowed with the quotient topology, and by it : X — > X the 
canonical projection. Let / : X — > N be the map with the property / o it = f. 

The property LC2 is hereditary: it is invariant to the choice of arbitrary small 
neighbourhoods V x . We obtain the following proposition (see Lemmas 1.8.1, 1.8.2. 
Sleewaegen ||30|| ) using topological reasoning and the hereditarity of LC2: 

Lemma 1.16 The space X is Hausdorff, connected and compact. The function f is 
proper, the preimage / _1 (y) has a finite number of elements and it satisfies the improved 
version of LC2: for any x G X there are a neighbourhood V x of x, a pointed closed 
convex set (C x , f(x)) and r x > such that the restriction f : V x — > C x n B(f(x), r x ) is 
a homeomorphism on the image. 
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Step 2. Denote by I the length functional in the compact set f{X) and by di the 
induced length distance. We begin by constructing a distance on X, which comes from 
the distance d\ and the function /. 

For any two points x, y S X let 

d(x,y) = inf |z(/°7) : 7 : [a,b] — > X continuous, 7(a) = x , 7(6) = y| 

We have the following immediate inequalities: 

d(x,y) > d,(/(x),/(y)) > d(f(x)J(y)) 

In order to check that c£; is a distance it suffices to show that if d(x, y) = then x = y. 
Indeed, we have /(a?) = f(y) and there are two possibilities. If y G (neighbourhood 
chosen as in the improved version of LC2) then y = x. Otherwise, if y g" V x then we 
have arbitrary short closed curves in Y which start and end at f(x) but which go out 
of C x fl B(f(x),r x ), which is in contradiction with r x > 0. 



Use now Gromov Theorem l.S to prove that (X, d) is a compact path metric space. 
Hopf-Rinow Theorem 1.11 ensures us that any two points x, y £ X can be joined by a 
global minimising geodesic. Moreover, such a geodesic projects on a global d\ geodesic 
in f(X), because of the construction of the distance d. 

Because f{X) = f(X) is compact, there is a K > 1 such that we can cover f(X) 
with a finite number of sets of the form C x knB(f(x k ),rf t ) and C x k P\B^{f{x k ), Kr^) C 
f(X). Then any d\ global geodesic is made by gluing a finite number of global geodesies 
in Y and the question is if such a curve is local geodesic in the neighbourhood of the 
gluing points. But the improved LC2 property ensures that. Indeed, take any two 
sufficiently closed points f(x) and f(y) on a d\ geodesic /07, with 7 global d geodesic. 
Then there is a neighbourhood V x of x such that y £ V x and / 07 is a Y global geodesic 
in f(X) which joins f(x) and f(y). I 

The compactness assumption on the space X is not important. For example if the 
constant K in the definition of closed convex pointed set can be chosen independent of 
x in LC2 then the compactness assumption is no longer needed. 

The classical Local-to-Global Principle is stated for vectorial target space Y, hence 
an arbitrary Euclidean space. Such spaces admit unique geodesic between any two 
different points, therefore the conclusion of the principle is that the image f{X) is 
convex. 

It seems hard to check that the function / satisfies LC2. In the case of the moment 
map this is a consequence of the existence of roots for some unitary representations. 
Note here the appearance of group theory. 



1.3 Distances between metric spaces 

The references for this section are Gromov |l3|] , chapter 3, and Burago & al. || section 
7.4. There are several definitions of distances between metric spaces. The very fertile 
idea of introducing such distances belongs to Gromov. 

In order to introduce the Hausdorff distance between metric spaces, recall the Haus- 
dorff distance between subsets of a metric space. 
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Definition 1.17 For any set A C X of a metric space and any e > set the e 

neighbourhood of A to be 

A £ = U x( z A B{x,e) 
The Hausdorff distance between A, B C X is defined as 

d%(A,B) = inf {e > : A C B £ , B C A e } 

By considering all isometric embeddings of two metric spaces X, Y into an arbitrary 
metric space Z we obtain the Hausdorff distance between X, Y (Gromov jy]] definition 
3.4). 

Definition 1.18 The Hausdorff distance dn(X,Y) between metric spaces X Y is the 
infimum of the numbers 

d z H (f(X),g(Y)) 

for all isometric embeddings f : X — > Z , g : Y — > Z in a metric space Z . 

If X, Y are compact then djj(X,Y) < +oo. Indeed, let Z be the disjoint union of 
X, Y and M = m&x{diam(X),diam(Y)}. Define the distance on Z to be 

(d x (x,y) x,y £ X 
d Y (x,y) x,yeY 
\M otherwise 

Then dfj{X,Y) < +oo. 

The Hausdorff distance between isometric spaces equals 0. The converse is also true 
(Gromov op. cit. proposition 3.6) in the class of compact metric spaces. 

Theorem 1.19 If X,Y are compact metric spaces such that dn(X,Y) = then X,Y 
are isometric. 

For the proof of the theorem we need the Lipschitz distance (op. cit. definition 3.1) 
and a criterion for convergence of metric spaces in the Hausdorff distance ( op. cit. 
proposition 3.5). We shall give the definition of Gromov for Lipschitz distance and the 
first part of the mentioned proposition. 

Definition 1.20 The Lipschitz distance di(X,Y) between bi-Lipschitz homeomorphic 
metric spaces X, Y is the infimum of 

I log du(f) i + 1 log duir 1 ) i 

for all f : X — > Y bi-Lipschitz homeomorphisms. 

Obviously, if di{X, Y) = then X, Y are isometric. Indeed, by definition we have 
a sequence f n : X — > Y such that dil(f n ) -> 1 as n -> oo. Extract an uniformly 
convergent subsequence; the limit is an isometry. 
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Definition 1.21 A e-net in the metric space X is a set P C X such that P e = X . 
The separation of the net P is 

sep(P) = inf {d(x,y) : x ^ y , x,y G P} 

A e-isometry between X and Y is a function f : X — > Y such that dis f < e and /(X) 
is a e net in Y . 

The following proposition gives a connection between convergence of metric spaces 
in the Hausdorff distance and convergence of e nets in the Lipschitz distance. 

Definition 1.22 A sequence of metric spaces (Xj) converyes in the sense of Gromov- 
Hausdorff to the metric space X if d#(X, Xj) — > as i — > oo. This means that there is 
a sequence r\i — > and isometric embeddinys fi : X — > Z{, gi : Xj — > Zj such that 

4*(/i(^)^ t (^)) < % 



Proposition 1.23 Xei (Xj) 6e a sequence of metric spaces converging to X and (r/j) 
as in definition 1.2$\ Then for any e > and any e-net P C X with strictly positive 



separation there is a sequence Pi C X such that 

1. Pi is a e + 2r]i net in Xj, 

di,{Pi,P) — > as « — > cx), uniformly with respect to P,Pi. 

We shall not use further this proposition, given for the sake of completeness. 
The next proposition is corollary 7.3.28 (b), Burago & al. ||. The proof is adapted 
from Gromov, proof of proposition 3.5 (b). 

Proposition 1.24 If there exists a e-isometry between X,Y then da{X,Y) < 2s. 

Proof. Let / : X — > Y be the e-isometry. On the disjoint union Z = X U Y extend 
the distances d , d Y in the following way. Define the distance between x 6 X and 
yeY by 

d(x,y) = mi{d x (x,u) + d Y (f(u),y)+e} 
This gives a distance d z on Z. Check that dfj(X, Y) < 2e. ■ 



The proof of theorem LIE follows as an application of previous propositions. 



1.4 Metric tangent cones 

The local geometry of a path metric space is described with the help of Gromov- 
Hausdorff convergence of pointed metric spaces. 
This is definition 3.14 Gromov [|13|. 



Definition 1.25 The sequence of pointed metric spaces (X n ,x n ,(i n ) converges in the 
sense of Gromov-Hausdorff to the pointed space (X, x, dx) if for any r > 0, e > there 
is no € N such that for all n > uq there exists f n : B n (x n , r) C X n — > X such that: 
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(1) fn(x n ) = X, 

(2) the distorsion of f n is bounded by e: dis f n < e, 

(3) B x (x,r) C (f n (B n (x n ,r))) £ . 

The Gromov-Hausdorff limit is defined up to isometry, in the class of compact 
metric spaces (Proposition 3.6 Gromov Jl3| ) or in the class of locally compact cones. 
Here it is the definition of a cone. 

Definition 1.26 A pointed metric space (X, xq) is called a cone if for any A > there 
is a map 5\ : X — > X such that 5\(xo) = xq and for any x,y € X 

d(Sx(x),S x (y)) = A d(x,y) 
Such a map is called a dilatation with center xq and coefficient A. 

The local geometry of a metric space X in the neighbourhood of xq € X is described 
by the tangent space to (X, xo) (if such object exists). 

Definition 1.27 The tangent space to (X, xq) is the Gromov-Hausdorff limit 

(T X0 ,0,d xo ) = lim (X, x , Xd) 

A^oo 

Three remarks are in order: 

1. The tangent space is obviously a cone. We shall see that in a large class of 
situations is also a group, hence a graded nilpotent group, called for short Carnot 
group. 

2. The tangent space comes with a metric inside. This space is path metric (Propo- 
sition 3.8 Gromov |l~3| ). 

3. The tangent cone is defined up to isometry, therefore there is no way to use the 
metric tangent cone definition to construct a tangent bundle. For a modification 
of the definitions in this direction see Margulis, Mostow [21|. 

One can ask for a classification of metric spaces which admit everywhere the same 
(up to isometry) tangent cone. Such classification results exist, for example in the 
category of Lipschitz N manifold. 

Definition 1.28 Let N be a cone with a path metric distance. A Lipschitz N manifold 
is an (equivalence class of, or a maximal) atlas over the cone N such that the change 
of charts is locally bi-Lipschitz. 



We shall see in section a rigidity (i.e. classification result) property of manifolds. 
Also, we shall prove in section 4.3 that a sub-Riemannian manifold (see definition 1.32) 
which is also a Lie group does not admit a manifold structure over the nilpotentisation 
of the defining distribution, such that the operation and the group exponential to be 
smooth. We will be forced therefore to modify the definition of a manifold in order to 
show that a group with a left invariant distribution has a manifold structure over the 
nilpotentisation. The modification will lead, surprising but natural, to (a generalisation 
of) the moment map. 
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1.5 Examples of path metric spaces 

In this section are given examples of path metric spaces. These are: 

- Riemannian manifolds 

- Finsler manifolds 

- Carnot-Caratheodory manifolds 

Let X be a Riemannian manifold and 1(f) be the length of the curve / with respect 
to the metric on X, if / is piecewise C 1 , otherwise 1(f) = +oo. This is, off course, a 
metric structure on the Riemannian X. The class of curves with potential finite length 
can be enlarged or restricted by using analytical arguments. 

Change the metric on X by a Finsler metric, i.e. consider a continuous mapping 
A : TX -> R such that for any x <G X, A € R and v <G T X X 

A x (\v) =| A | A x (v) 

Then for any piecewise C 1 curve c : [a, b] — > X define it's length by 

1(c) = f A c(t) (c(t)) dt 

J a 

This give rise to a (non-oriented) length structure on X. 

Instead of a Finsler metric A consider an oriented version obtained by imposing 
a milder condition upon A: to be positively one-homogeneous. That means: for any 
x G X, A > and v G T X X 

A x (\v) = XA x (v) 

The length structure associated to A is now oriented. 

A particular case is the one of a Finsler-Minkowski metric. Suppose first that (X, g) 
is Riemannian (g is the metric). Take Q x C T X X a convex bounded set and denote by 
Xx the characteristic function 

Axy ' \ +oo otherwise 

Suppose that Q x contains a ball centered in the origin. Define now the Finsler- 
Minkowski metric 

A x (v) = sup {g(v,u) - Xx(u) : u E T X X} 

that is A x is the polar of Xx- From the hypothesis, there are positive constants c, d 
such that (| v \ x = \/g(v, v)) 

c | v \ x < A x (v) < d | v \ x 

The length structure associated to A is in general oriented. It becomes non oriented if 
the set Q x is symmetric with respect to the origin. 

In almost all examples we have a Riemannian manifold and a length structure 
constructed from a map A. We shall suppose now that the manifold is complete with 
respect to the Riemannian distance. 
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Proposition 1.29 The Hopf-Rinow theorem holds on a Finsler- Minkowski manifold, 
provided that it is complete with respect to the initial Riemannian distance and that A 
is convex. 

A more particular case is the following: 

Proposition 1.30 Let G be a Lie group and U be a left invariant Finsler- Minkowski 
metric on G. Then (G,djj) is complete and locally compact. 

Proof. The exponential map exp is a diffeomorphism from V(0) 3 U C g to exp U. 
From the hypothesis we get that exp^ is djj Lipschitz. The exponential is also open. 
G is locally a linear group, hence from the hypothesis there is a constant c such that 

du(e, exp v) > c\\v\\ 

Therefore the metric topology given by djj is the same as the manifold topology, hence 
locally compact. 

Take now a Cauchy sequence (xh). For a sufficiently small e > there is iV = N(e) 
such that x~^Xh lies in a compact neighbourhood of the identity element e for any 
h > N. One can extract a convergent subsequence from it and the proof finishes. ■ 

As a corollary of propositions 1.29 and 1.30| we have the following concrete Hopf- 
Rinow theorem: 

Corollary 1.31 On a Lie group endowed with a convex left (or right) invariant Finsler- 
Minkowski metric any two points can be joined by a geodesic. 

Another class of examples is provided by sub-Riemannian manifolds. 

Definition 1.32 A sub-Riemannian (SR) manifold is a triple (M,H,g), where M is a 
connected manifold, H is a subbundle ofTM, named horizontal bundle or distribution, 
and g is a metric (inner-product) on the horizontal bundle. 

A horizontal curve is a continuous, almost everywhere differ entiable curve, whose 
tangent lies in the horizontal bundle. 

The length of a horizontal curve c : [a,b] — > M is 

1(c) = [ g(c(t),c(t)) dt 



The SR manifold is called a Carnot-Caratheodory (CC) space if any two points can 
be joined by a finite length horizontal curve. 

A CC space is a path metric space with the Carnot-Caratheodory distance induced 
by the length I: 

d(x,y) = inf {1(c) : c : [a,b] — > M , c(a) = x , c(b) = y} 

The particular case that will be interesting for us is the following one. Consider a 
connected real Lie group and a vector space V C g, which generates the whole algebra 
g by Lie brackets. Then we shall see that the left translations of V provide a non- 
integrable distribution V on G (easy form of Chow theorem). For any (left or right 
invariant) metric defined on D we have an associated Carnot-Caratheodory distance. 
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2 Carnot groups 

2.1 Structure of Carnot groups 

Definition 2.1 A Carnot (or stratified nilpotent) group is a connected simply con- 
nected group N with a distinguished vectorspace V\ such that the Lie algebra of the 
group has the direct sum decomposition: 

m 
i=l 

The number m is the step of the group. The number 

m 

Q = i dimVi 
i=i 

is called the homogeneous dimension of the group. 

Because the group is nilpotent and simply connected, the exponential mapping is a 
diffeomorphism. We shall identify the group with the algebra, if is not locally otherwise 
stated. 

The structure that we obtain is a set TV endowed with a Lie bracket and a group 
multiplication operation. 

The Baker-Campbell-Hausdorff formula shows that the group operation is polyno- 
mial. It is easy to see that the Lebesgue measure on the algebra is (by identification) 
a bi-invariant measure. 

We give further examples of such groups: 

(1.) R n with addition is the only commutative Carnot group. 
(2.) The Heisenberg group is the first non-trivial example. This is the group 
H(n) = R 2n x R with the operation: 

(x,x)(y,y) = (x + y,x + y+^u(x,y)) 

where u is the standard symplectic form on R? n . The Lie bracket is 

[{x,x),{y,y)\ = (0,uj(x,y)) 

The direct sum decomposition of (the algebra of the) group is: 

H{n) = V + Z , V = R 2n x {0} , Z = {0} x R 

Z is the center of the algebra, the group has step 2 and homogeneous dimension 2n + 2. 

(3.) H-type groups. These are two step nilpotent Lie groups N endowed with an 
inner product (•,■), such that the following orthogonal direct sum decomposition occurs: 

N = V + Z 
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Z is the center of the Lie algebra. Define now the function 

J:Z^End{V), {J z x,x') = (z,[x,x'}) 
The group N is of H-type if for any z € Z we have 

J z o J z = — | z | 2 I 
From the Baker-Campbell-Hausdorff formula we see that the group operation is 

(x,z)(x',z) = (x + x',z + z' + -[x,x']) 

These groups appear naturally as the nilpotent part in the Iwasawa decomposition of 
a semisimple real group of rank one. (see ||) 

(4.) The last example is the group of n x n upper triangular matrices, which 
is nilpotent of step n — 1. This example is important because any Carnot group is 
isomorphic with a subgroup of a group of upper triangular matrices. 

Any Carnot group admits a one-parameter family of dilatations. For any e > 0, the 
associated dilatation is: 

m m 

x = ' S ^ j Xi h-> 5 £ x = e % Xj 

i=l i=l 

Any such dilatation is a group morphism and a Lie algebra morphism. 

In fact the class of Carnot groups is characterised by the existence of dilatations. 

Proposition 2.2 Suppose that the Lie algebra q admits an one parameter group e € 
(0, +oo) i — > 5 £ of simultaneously diagonalisable Lie algebra isomorphisms. Then Q is 
the algebra of a Carnot group. 

Proof. The hypothesis means that there is a direct sum decomposition of 
such that for any e > we have 

m m 

x = i-> 5 E x = y^fj(e)xj 

i=l i=l 

with fi continuous, moreover 

8 e o 8^ = 5 £fl 

for any e,/i > and 8\ = id. From this we get fi(s) = e ai . Each 8 £ is also Lie 
algebra morphism, therefore (xi € Vi, Xj € Vj) 

8s[Xi, Xj ] — [^Xj, — £ -i[Xi } Xj~\ 

We conclude that Oj + aj is also an eigenvalue, unless [xi,Xj] = 0. The direct sum 
decomposition of g is finite therefore Qj = ia and, more important, [Vi, Vj] = V^+j. 
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In conclusion g is the Lie algebra of a Carnot group and 5 e are the dilatations, up to a 
scale factor a. ■ 

We can always find inner products on N such that the decomposition N = Y17=i ^ 
is an orthogonal sum. 

We shall endow the group N with a structure of a sub-Riemannian manifold now. 
For this take the distribution obtained from left translates of the space V\ . The metric 
on that distribution is obtained by left translation of the inner product restricted to 
Vl 

If V\ Lie generates (the algebra) N then any element x € N can be written as a 
product of elements from V±. An useful lemma is: 

Lemma 2.3 Let N be a Carnot group and Xi,...,X p an orthonormal basis for V\. 
Then there is a a natural number M and a function g : {1,...,M} — > {l,...,p} such 
that any element x £ N can be written as: 

M 

x = Y[exp(tiX gii) ) (2.1.1) 

i=i 

Moreover, if x is sufficiently close (in Euclidean norm) to then each ti can be chosen 
such that | U |< C||a;|| 1/m 

Proof. This is a slight reformulation of Lemma 1.40, Folland, Stein ||]. We have used 
Proposition |2.8| applied for the homogeneous norm | • |i (see below). ■ 

This means that there is a horizontal curve joining any two points. Hence the 
distance 

d(x,y) = inf I J ||c _1 c|| dt : c(a) = x, c(b) = y, c _1 c 

is finite for any two x, y G N. The distance is obviously left invariant. 
A Carnot group iV can be therefore described in the following way. 

Proposition 2.4 (JV, d) is a path metric cone and also a group such that left transla- 
tions are isometrics. Also, dilatations are group isomorphisms. 

Associate to the CC distance d the function: 

| x | d = d(0,x) 

This function has the following properties: 

(a) the set {x € G : | x \d= 1} does not contain 0. 

(b) | \d=\ x \d for any ifG. 

(c) | 5 £ x \d= £ | x \d for any x € G and e > 0. 

(d) for any x, y £ N we have 

I xy \ d < | x \ d + | y \d 

Proposition 2.5 Balls with respect to the distance d are open and there is a basis for 
the topology on N (as a manifold) made by d balls. 
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Proof. We have to prove that d induces the same topology on N as the manifold 
topology, which is the standard topology induced by Euclidean distance on N. 

Set d,R to be the Riemannian distance induced by a metric which extends the metric 
on D and makes the direct sum N = ®iVi orthogonal. We shall have then the 
inequality 

dn{x,y) < d(x,y) 

for all x,y E N. 

The function | • |^ is also continuous. Indeed, from Lemma 2^3 we see that 

M 



x\d< ^2\u 



8=1 

with ti — > when — > 0. 

These two facts prove the thesis. ■ 

The map | • \d looks like a norm. However it is intrinsically defined and hard to 
work with. This is the reason for the introduction of homogeneous norms. 

Definition 2.6 A continuous function from x i— >| x \ from G to [0,+oo) is a homoge- 
neous norm if 

(a) the set {x E G : \ x \= 1} does not contain 0. 

(b) | x^ 1 |=| a; | for any x G G. 

(c) \ 5 E x |= e | x | for any x € G and e > 0. 



Homogeneous norms exist. An example is: 



x li = 2 



r- I 1 / 1 



1 = 1 



Proposition 2.7 ^4ny iu?o homogeneous norms are equivalent. Let \ ■ \ be a homoge- 
neous norm. Then the set {x : | x |= 1} is compact. 

There is a constant C > such that for any x,y E N we have: 

| xy | < C (| a; | + | y |) 



Proof. We show first that there are constants c, C > such that for any x G iV we 
have 

c | x |i < | x | < C | x | 

For this consider the compact set B = {x : | x |i= 1}. The norm | • |i has a minimum 
c and a maximum C on the set £> (0 < c < C). Let now x E N, x ^ 0. Then x = 5 £ y 
with e = | x \\. The homogeneity of the norms show that | y \\= 1 and the proof is 
done. 
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A consequence of the equivalence of the norms is that for any norm | • | the set 
{x : | x |= 1} is compact. 

For the third assertion remark that the set in N 2 of all (x, y) such that | x | + | y \ 
< 1 is compact. The function (x,y) i— >| x + y | attains a maximum C on this set. Use 
again the homogeneity of the norm to finish the proof. ■ 

Let us denote by || • || the Euclidean norm on N. The following estimate is easily 
obtained using homogeneity, as in the previous proposition. 

Proposition 2.8 Let | • | be a homogeneous norm. Then there are constants c,C > 
such that for any i£JV, | x |< 1, we have 

c\\x\\ < I x I < C\\x\\ 1/m 



The balls in CC-distance look roughly like boxes (as it is the case with the Euclidean 
balls). A box is a set 



Box{r) = I x = : \\xi\\ <r l 



i=i 



Proposition 2.9 ("Ball-Box theorem") There are positive constants c,C such that for 
any r > we have: 

Box(cr) C B(0, r) C Box{Cr) 



Proof. The box Box(r) is the ball with radius r with respect to the homogeneous 
norm 

[ x |oo = max jllxjll 1 /''! 



By Proposition 2.7 the norm | • \ CjQ is equivalent with | ■ which proves the thesis. 



A consequence of the Ball-Box Theorem is 

Theorem 2.10 (Hopf-Rinow theorem for Carnot groups) Any two points in a Carnot 
group can be joined by a geodesic. 

Proof. Take two arbitrary points in N. Because the distance is left invariant we can 
choose x = 0. Because dilatations change the distances by a constant factor, we can 
suppose that y G -B(0, 1). 

A previous proposition shows that the topology generated by a homogeneous norm 
and the topology generated by the Euclidean norm are the same. Therefore the space 
is metrically complete. The Ball-Box Theorem implies that the ball B(0, 1) is compact. 



We are in the assumptions of the abstract Hopf-Rinow theorem 1.11 
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An easy computation shows that 

C(B(0,5 £ r)) = e Q £(B(0,r)) 

As a consequence the Lebesgue measure is absolutely continuous with respect to the 
Hausdorff measure H . Because of the invariance with respect to the group operation 
it follows that the Lebesgue measure is a multiple of the mentioned Hausdorff measure. 

The following theorem is Theorem 2. from Mitchell p3| ], in the particular case of 
Car not groups. 

Theorem 2.11 The ball B(0, 1) has Hausdorff dimension Q. 

Proof. We know that the volume of a ball with radius e is cs® . 

Consider a maximal filling of B(0, 1) with balls of radius e. There are N e such balls 
in the filling; an upper bound for this number is: 

A £ < l/e Q 

The set of concentric balls of radius 2e cover B(0, 1); each of these balls has diameter 
smaller than 4e, so the Hausdorff a measure of -B(0, 1) is smaller than 

lim N £ {2e) a 

which is if a > Q. Therefore the Hausdorff dimension is smaller than Q. 

Conversely, given any covering of B(0, 1) by sets of diameter < e, there is an as- 
sociated covering with balls of the same diameter; the number M £ of this balls has a 
lower bound: 

M £ > l/e Q 

thus there is a lower bound 

e a > e a /e Q 

cover 

which shows that if a < Q then 7i a (B(0, 1)) = oo. Therefore the Hausdorff dimension 
of the ball is greater than Q. ■ 

We collect the important facts discovered until now: let A be a Carnot group 
endowed with the left invariant distribution generated by V\ and with an Euclidean 
norm on V\. 

(a) If V\ Lie-generates the whole Lie algebra of A then any two points can be joined 
by a horizontal path. 

(b) The metric topology of A is the same as Euclidean topology. 

(c) The ball B(0, r) looks roughly like the box {x = Y^iLi x i '■ \\ x i\\ — r *}- 

(d) the Hausdorff measure is group invariant and the Hausdorff dimension of a 
ball is Q. 

(e) the Hopf-Rinow Theorem applies. 

(f ) there is a one-parameter group of dilatations, where a dilatation is an isomorphism 
5 £ of A which transforms the distance d in ed. 
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2.2 Pansu differentiability 



A Carnot group has it's own concept of differentiability, introduced by Pansu [25]. 

In Euclidean spaces, given / : R n — * R m and a fixed point x £ R n , one considers 
the difference function: 



rn 



The convergence of the difference function as t — > in the uniform convergence gives 
rise to the concept of differentiability in it's classical sense. The same convergence, but 
in measure, leads to approximate differentiability. Other topologies might be considered 
(see Vodop'yanov |33|]). 

In the frame of Carnot groups the difference function can be written using only 
dilatations and the group operation. Indeed, for any function between Carnot groups 
/ : G — ► P, for any fixed point and e > the finite difference function is defined 

by the formula: 

X € B(l) cGh 5- 1 {/(x)- 1 / (x5 £ X)) G P 

In the expression of the finite difference function enters 5" 1 and S e , which are dilatations 
in P, respectively G. 

Pansu's differentiability is obtained from uniform convergence of the difference func- 
tion when e — > 0. 

The derivative of a function / : G — ► P is linear in the sense explained further. For 
simplicity we shall consider only the case G = P. In this way we don't have to use a 
heavy notation for the dilatations. 

Definition 2.12 Let N be a Carnot group. The function F : N — > N is linear if 

(a) F is a group morphism, 

(b) for any e > F o 5 e = 5 e o F. 

We shall denote by HL(N) the group of invertible linear maps of N , called the linear 
group of N . 

The condition (b) means that F, seen as an algebra morphism, preserves the grading 
of N. 

The definition of Pansu differentiability follows: 

Definition 2.13 Let f : N — > N and x € N. We say that f is (Pansu) differentiable 
in the point x if there is a linear function Df(x) : N — > N such that 

sup {d(F £ (y),Df(x)y) : y € B(0,1)} 

converges to when e — > 0. The functions F e are the finite difference functions, defined 
by 

F t {y) = 5t l {f{x)- l f(x8 t y)) 
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The definition says that / is differentiable at x if the sequence of finite differences 
Ft uniformly converges to a linear map when t tends to 0. 

We are interested to see how this differential looks like. For any / : N — » N, 
x,y G N Df(x)y means 

Df(x)y = lim S' 1 (fix)- 1 f(xS t y)) 
provided that the limit exists. 

Proposition 2.14 Let f : N ^ N , y, z £ N such that: 
(i) Df{x)y, Df(x)z exist for any x E N. 
(ii) The map x i— > Df{x)z is continuous. 

(Hi) x i — > (/(x) _1 /(x<5tz)) converges uniformly to Df(x)z. 
Then for any a, b > and any w = daxS^y the limit Df(x)w exists and we have: 

Df(x)S a x5 b y = 5 a Df(x)y 5 b Df(x)z 



Proof. The proof is standard. Remark that if Df(x)y exists then for any a > the 
limit Df{x)8 a y exists and 

Df(x)5 a y = 5 a Df(x)y 

It is not restrictive therefore to suppose that w = yz, such that | y \d = 1. 
We write: 

5^ (f{x)- l f{x5 t w)) = (1)(2)(3) 

where 

(1) = 5^ {f{x)- l f{x5 t y)) 
(2) = 5i l (f(xS ty y l f(xS t y5 t z)) (D f (x5 t y) z)' 1 
(3) = Df(x5 t y)z 

When t tends to (i) implies that (1) tends to Df(x)y, (ii) implies that (3) tends to 
Df(x)z and (iii) implies that (2) goes to 0. The proof is finished. ■ 

If / is Lipschitz then the previous proposition holds almost everywhere. In Propo- 
sition 3.2, Pansu |23] there is a more general statement: 

Proposition 2.15 Let f : N — > N have finite dilatation and suppose that for almost 
any x 6 N the limits Df(x)y and Df(x)z exist. Then for almost any x and for any 
w = °~a,y b~bZ the limit Df{x)w exists and we have: 

Df(x)5 a x5 b y = 5 a Df(x)y 5 b Df(x)z 
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Proof. The idea is to use Proposition 2.14. We shall suppose first that / is Lipschitz. 
Then any finite difference function Ft is also Lipschitz. 
Recall the general Egorov theorem: 

Lemma 2.16 (Egorov Theorem) Let [i be a finite measure on X and (f n )n a sequence 
of measurable functions which converges fx almost everywhere (a.e.) to f . Then for 
any e > there exists a measurable set X e such that fi(X \ X £ ) < e and f n converges 
uniformly to f on X £ . 

Proof. It is not restrictive to suppose that f n converges pointwise to /. Define, for 
any g,peN, the set: 

X q>p = {x G X : | f n (x) - f(x) | < 1/p , Vn > q} 

For fixed p the sequence of sets X qtP is increasing and at the limit it fills the space: 

(J x q>p = X 

Therefore for any e > and any p there is a q(p) such that n(X \ X q ^ p ) < e/2 p . 
Define then 

X £ = f] X q(p)p 

P&N 

and check that it satisfies the conclusion. ■ 

We can improve the conclusion of Egorov Theorem by claiming that if \i is a Borel 
measure then each X e can be chosen to be open. 

In our situation we can take X to be a ball in the group N and fi to be the Q 



Hausdorff measure, which is Borel. Then we are able to apply Proposition 2.14 on X £ . 
When we tend e to we obtain the claim (for / Lipschitz). ■ 



A consequence of this proposition is Corollaire 3.3, Pansu [25]. 



Corollary 2.17 If f : N — > N has finite dilatation and X\,...,Xr; is a basis for the 
distribution V% such that 

s h-> f(x5 s Xi) 

is differentiable in s = for almost any x € N , then f is differentiate almost every- 
where and the differential is linear, i.e. if 

M 

y = H 6 uX g(l) 
i=i 

then 

M 



Df{x)y = Y[5 u X g{l) {f)(x) 



i=l 
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With the definition of (Pansu) derivative at hand, it is natural to introduce the 
class of C 1 maps from N to M. 

Definition 2.18 Let N,M be Carnot groups. A function f : N — > M is of class 
C 1 if it is Pansu derivable everywhere and the derivative is continuous as a function 
Df : N x N — > M x M . Here N x N and M x M are endowed with the product topology. 

Remark 2.19 For example left translations L x : N — > N , L x (y) = xy are C 1 but 
right translations R x : N — > N, L x (y) = yx are not even derivable. 

We can introduce the class of C 1 N manifolds. 

Definition 2.20 Let N be a Carnot group. A C 1 N manifold is an (equivalence class 
of, or a maximal) atlas over the group N such that the change of charts is locally 
bi-Lipschitz. 



2.3 Rademacher theorem 

A very important result is theorem 2, Pansu [p5f| , which contains the Rademacher 
theorem for Carnot groups. 

Theorem 2.21 Let f : M — > N be a Lipschitz function between Carnot groups. Then 
f is differ entiable almost everywhere. 



The proof is based on the corollary 2.17 and the technique of development of a 
curve, Pansu sections 4.3 - 4.6 

Let c : [0, 1] — > N be a Lipschitz curve such that c(0) = 0. To any division 
E : = to < .... < t n = 1 of the interval [0, 1] is a associated the element as € N (the 
algebra) given by: 

n 

o"e = y^ j c(t k y 1 c(t k+1 ) 

k=0 



Lemma (18) Pansu [26] implies the existence of a constant C > such that 



2 



Iks - c(l)|| < C £>(**)~ lc (**+i)ll (2.3.2) 
\fc=o / 

Take now a finer division £' and look at the interval [tfc , ifc+i] divided further by S' 
like this: 

tk = t\ < ... < t' m = t k+ i 
The estimate ( p. 3. 2 ) applied for each interval [^,4+1] lead us to the inequality: 

■n 

Iks' - CFT,\\ < ^2d(c(t k ),c(t k+ i)f 
k=0 
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The curve has finite length (being Lipschitz) therefore the right hand side of the previous 
inequality tends to with the norm of the division. Set 

a(s) = lim o"s(s) 

|| S || — >0 

where <ts(s) is relative to the curve c restricted to the interval [0, s]. The curve such 
defined is called the development of the curve c. It is easy to see that a has the same 
length (measured with the Euclidean norm || ■ ||) as c. If we parametrise c with the 
length then we have the estimate: 

Ms) c{s)\\ < Cs 2 (2.3.3) 

This shows that a is a Lipschitz curve (with respect to the Euclidean distance). Indeed, 
prove first that a has finite dilatation in almost any point, using ( 2.3.3| ) and the fact 



that c is Lipschitz. Then show that the dilatation is majorised by the Lipschitz constant 
of c. By the classical Rademacher theorem a is almost everywhere derivable. 

Remark 2.22 The development of a curve can be done in an arbitrary Lie connected 
group, endowed with a left invariant distribution which generates the algebra. One 
should add some logarithms, because in the case of Carnot groups, we have identified 
the group with the algebra. The inequality ( [2.3.3 ) still holds. 



Conversely, given a curve a in the algebra N, we can perform the inverse operation 
to development (called multiplicative integral by Pansu; we shall call it "lift"). Indeed, 
to any division X of the interval [0, s] we associate the point 

n 

CT,{s) = \\ - °~{tk)) 

k=0 

Define then 

c(s) = lim cs(s) 
||£||-*0 

Remark that if o"([0, 1]) C V\ and it is almost everywhere differentiable then c is a 
horizontal curve. 

Fix s G [0, 1) and parametrise c by the length. Apply the inequality ( 2.3.3] ) to the 



curve t I—*- c(s)~ 1 c(s + t). We get the fact that the vertical part of a(s + t) — a(s) is 
controlled by s 2 . Therefore, if c is Lipschitz then a is included in V\. 

Denote the i multiple bracket [x, [x, ...[x,y]...] by [x,y]j. For any division E of the 
interval [0, s] set: 

n 

4(«) = X^-O^fc+i)]* 

fc=0 

As before, one can show that when the norm of the division tends to 0, A\, converges. 
Denote by 



A\s) = !\o-,do-]i = lim AUs) 

JO ll E IH0 



2 CARNOT GROUPS 



29 



the i-area function. The estimate corresponding to ( 2.3. 3j ) is 

||A*(a)|| < Cs i+l (2.3.4) 

What is the signification of A l (s)7 The answer is simple, based on the well known 
formula of derivation of left translations in a group. Define, in a neighbourhood of in 
the Lie algebra q of the Lie group G, the operation 

X s Y = log G (exp G (X)exp G (Y)) 

The left translation by X is the function Lx(Y) = X ■ Y . It is known that 

dl x (o)(z) = Y,j-TTv [x ^ z]l 

It follows that if the group G is Carnot then [X, Z\ measures the infinitesimal variation 
of the Vi component of Lx(Y), for Y = 0. Otherwise said, 



d 1 _• f t+e 

dt (« + l)!e->0 j t 

Because a is Lipschitz, the left hand side exists for all i for a.e. t. The estimate fl2.3.4|) 
tells us that the right hand side equals 0. This implies the following proposition (Pansu, 
4.1 



Proposition 2.23 If c is Lipschitz then c is differentiable almost everywhere. 
Proof of theorem 2.21. The proposition 2.23| implies that we are in the hypothesis 



of corollary 2.17. 



Pansu-Rademacher theorem has been improved for Lipschitz functions / : A C 
M — > N, where M, N are Carnot groups and A is just measurable, in Vodop'yanov, 
Ukhlov [34]. Their technique differs from Pansu. It resembles with the one used in 



Margulis, Mostow [20], where it is proven that any quasi-conformal map from a Carnot- 
Caratheodory manifold to another is a.e. differentiable, without using Rademacher the- 
orem. Same result for quasi-conformal maps on Carnot groups has been first proved by 
Koranyi, Reimann |i~7]| . Finally Magnani |l9| reproved a.e. differentiability of Lipschitz 
functions on Carnot groups, defined on measurable sets, continuing Pansu technique. 

For a review of other connected results, such as approximate differentiability, dif- 
ferentiability in Sobolev topology, see the excellent Vodop'yanov |33j] . 

2.4 Area formulas 

The subject of this subsection is the change of variable formula. In order to prove it 
one needs more involved (though basic) knowledge on metric measure spaces. We shall 
just sketch the proofs, delaying for the second part of these notes the rigorous proofs. 
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Definition 2.24 A metric measure (or mm) space is a metric space (X,d) endowed 
with a Borel measure \i satisfying the doubling condition, i. e. there is a positive constant 
C such that 

n(B{x,2r)) < C/j{B(x,r)) 

for any x € X , r > 0. 

A Carnot group is a metric measure space, endowed with the invariant volume 
measure (which is the Lebesgue measure). Indeed, this is a consequence of the existence 
of dilatations. 

In any mm space the Vitali covering theorem holds: 

Theorem 2.25 (Vitali) Let (X,d,fj,) be a mm space, Ad X and T a family of closed 
sets in X which covers A. If for any x € A and e > there exists V £ J 7 such that 
x € V and diam V < e then one can extract from T a countable disjoint family (Vi)jgjv 
such that 

fi{A\U ieN Vi) = 

Vitali covering theorem is the only technical ingredient needed to prove the following 
result (compare with Margulis, Mostow |2C], lemma 2.3). But before this we need a 
definition. 

Definition 2.26 Let (X,d,fJ,) be a mm space and a a measure on X. The lower 
spherical density of the measure a at the point x £ X is 

= liminf4^4 

dfi e^o u(B(x,e)) 

If (Y, a) is measure space and f : X — > Y is measurable then the pull-back of a by 
f is the measure f*(o~) on X defined by: 

f{a)(A) = a(f(A)) 

The Jacobian of f in x € X is the spherical density of f*{o~) in x: 

d/jL £-»o u{B(x,e)) 

Proposition 2.27 Let (X, d, fi) be a mm space, (Y, a) a finite measure space and f : 
X — > Y be an injective measurable map. We have then the inequality: 

I J fix) d/j, < a(f(X)) (2.4.5) 
Jx 

In order to establish the change of variables (or area formula) for / bi-Lipschitz 
between Carnot groups one has to prove first that in ( [2.4.5 ) we have equality. This 



is done by showing that n{Z) = 0, where Z is the set of zeroes of J f . Further, one 



has to use the Rademacher theorem 2.21 in order to show that the Jacobian equals the 
(modulus of the) determinant of the derivative, almost everywhere. Indeed, the idea is 
to use the uniform convergence in the definition of Pansu derivative. We arrive to the 
change of variables theorem. 
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Theorem 2.28 Let f : A d M ^ N be a bi-Lipschitz function from a measurable 
subset A of a Carnot group M to another Carnot group N . We have then: 

I |detD/(x)| dx = vol f(A) (2.4.6) 

J A 



A corollary of the area formula is the fact that bi-Lipschitz maps have Lusin prop- 
erty. 

Corollary 2.29 Let f : A C M — > N be a bi-Lipschitz function from a measurable 
subset A of a Carnot group M to another Carnot group N . If B C A is a set of 
measure then f{B) has measure. 



2.5 Rigidity phenomenon 

As an application to the previous sections we shall prove the following well known 
rigidity result. 

Theorem 2.30 Let X, Y be Lipschitz manifolds over the Carnot groups M, respectively 
N. If there is no subgroup of N isomorphic with M then there is no bi-Lipschitz em- 
bedding of X inY . 



Proof. Suppose that such an embedding exists. This implies (by choosing charts 
for X, Y) that there is a bi-Lipschitz embedding / : A C M — > N with A open. 
Rademacher theorem 2.21 and corollary 2.2S| imply that for almost any x £ A Df{ 



■T) 



exists and it is an injective morphism from M to N . This contradicts the hypothesis. 
■ 

For example there is no bi-Lipschitz function from an open set in a Heisenberg group 
to an Euclidean space R n . Otherwise stated, the Heisenberg group (or any other non- 
commutative Carnot group), is purely unrectifiable. This calls for an intrinsic definition 
of rectinability. The subject will be treated in the second part of these notes, where 
we shall propose the following point of view: rectinability is a theory of irreducible 
representations of the group of bi-Lipschitz homeomorphisms. 



Let us give another example of rigidity, connected to Sussmann [31], section 8, exam- 
ple of abundance of abnormal geodesies on four-dimensional sub-Riemannian manifolds. 

Consider a connected Lie group G with Lie algebra q generated by Xi, X2, X3, Xj, 
such that the following conditions hold: 

(i) X 3 = [X U X 2 ], X 4 = [XuXs], 

(ii) [X 2 ,X 3 ] = aXi + PX 2 + 7X3 such that (3^ 

Close examination shows that there is a three-dimensional family of Lie algebras sat- 
isfying these bracket conditions. Set D = span {Xi,X 2 } and endow G with the left 
invariant distribution generated by D and with the metric obtained by left translation of 
a metric on D such that X%, X 2 are orthonormal. Sussmann proves that t 1— > exp G (tX 2 ) 
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is an abnormal geodesic (see the second part of these notes or Sussmann H, or Mont- 
gomery [24] and the references therein). 



In this section we shall look to the nilpotentisation of the group G (section 4.1 
The algebra g admits the filtration 

V 1 = span {X U X 2 } C V 2 = span {X X ,X 2 ,X 3 } C g 

The nilpotentisation is q endowed with the following bracket: 

[Xi,X 2 ]n = X 3 , [Xi,X 3 ]n = X 4 

all the other brackets being zero. The dilatations in this Carnot algebra are 

S £ (y^ajXi) = ea\X\ + ea 2 X 2 + e 2 a 3 X 3 + e 3 a 4 X 4 

Denote by N the Carnot group which has the algebra (q, [•, -]n)- The group operation 
on N will be denoted by 

The linear group HL(N) is the class of linear invertible maps which commute 
with the dilatations and preserve the nilpotent bracket. We give without proof the 
description of this group (for a similar proof see proposition |3.l| ) . 



Proposition 2.31 In the basis {X\, X^} the linear group HL(N) is made by all 
linear transformations represented by the matrices of the form 

( a n \ 

021 a 22 

aua 22 

\ a\ x a 22 ) 

for all an, 022 7^ 0. 

Set H = span {Xi,X 3 ,X^}. This is a group isomorphic with the Heisenberg 
group H(l), with respect to the nilpotent bracket. Define now the family of horizontal 
curves 

c z {t) = z n (tX 2 ) 
for any z € H. Any such curve is transversal to H. 

Proposition 2.32 Let <j) : g — > g be a bi-Lipschitz map on the Carnot group N. Then 
for almost every z G H (with respect to the Lebesgue measure on H) there is z' € H 
such that 

<j>{c z {R)) C c z ,(R) 
Otherwise stated the family of curves c z is preserved by N bi-Lipschitz maps. 
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Proof. Let M be the family of all z E H such that </> is not a.e. differentiable in the 
points c z (t). Then M has to be negligible in H, otherwise we contradict Rademacher 
theorem 2.21| . Take now z E H \ M. Then for almost any t E R 4> is differentiable in 



c 2 (t) and (j>o c z is differentiable in t. From the chain rule it follows that 

l(^oc z )(t) = D<P(c z (t))j t c z (t) 



(all derivatives are Pansu). But 



±c z (t) = X 2 



and from proposition 2.31 it follows that 



■^(<f>oc z )(t) = X(z,t)X 2 

Indeed, D(p(x) E HL(N) and the direction X 2 is preserved by any linear F E HL{N). 
Because the curve o c 2 is almost everywhere tangent to X 2 it follows that it is a curve 
in the family c z ■ 



Let us turn back to theorem 2.30 and remark that it can be improved. With the 
notation used before H is a subgroup of N and H is isomorphic with the Heisenberg 
group H(l). Nevertheless the isomorphism does not commute with dilatations, or 
better, there is no isomorphism / : H(l) — > N commuting with dilatations: 

/oSfW = «5f 

Therefore there can be no bi-Lipschitz embedding of H (1) in N. This is because linear 
maps are not only group morphisms, but also commute with dilatations. 

Definition 2.33 Let N , M be Carnot groups. We write N < M if there is an injective 
group morphism f : N — > M which commutes with dilatations. 



Theorem 2.30 improves like this: 



Theorem 2.34 Let X, Y be Lipschitz manifolds over the Carnot groups M, respectively 
N. Lf N ji M then there is no bi-Lipschitz embedding of X inY . 

Rigidity in the sense of this section manifests in subtler ways. The purpose of Pansu 
paper was to extend a result of Mostow |22] , called Mostow rigidity. Although it is 
straightforward now to explain what Mostow rigidity means and how it can be proven, 
it is beyond the purposes of these notes. 



3 The Heisenberg group 

Let us rest a little bit and look closer to an example. The Heisenberg group is the 
most simple non commutative Carnot group. We shall apply the achievements of the 
previous chapter to this group. 
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3.1 The group 

The Heisenberg group H(n) = R 2n+1 is a 2-step nilpotent group with the operation: 

(x, x)(y, y) = (x + y,x + y + ^u(x, y)) 

where u is the standard symplectic form on R 2n . We shall identify the Lie algebra with 
the Lie group. The bracket is 

[(x,x), (y,y)] = (0,u(x,y)) 
The Heisenberg algebra is generated by 

V = R 2n x {0} 

and we have the relations V + [V, V) = H(n), {0} x R = [V, V) = Z(H(n)). 
The dilatations on H{n) are 

5 £ (x, x) = (ex, e 2 x) 

We shall denote by HL(H(n)) the group of invertible linear transformations and 
by SL(H(n)) the subgroup of volume preserving ones. 

Proposition 3.1 We have the isomorphisms 

HL(H(n)) w CSp{n) , SL{H{n)) w Sp(n) 



Proof. By direct computation. We are looking first for the algebra isomorphisms of 
H(n). Let the matrix 



represent such a morphism, with A € gl(2n,R), b, c <G R 2n and a <G R. The bracket 
preserving condition reads: for any (x,x), (y,y) € H(n) we have 

(0, uj(Ax + xb, Ay + yb)) = (u(x, y)b, auj(x, y)) 

We find therefore 6 = and u>(Ax,Ay) = au>(x,y), so A G CSp(n) and a > 0, a n = 
del A. 

The preservation of the grading gives c = 0. The volume preserving condition means 
a n+i _ i j ience a _ i anc [ ^4 e Sp(n). ■ 
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Get acquainted with Pansu differential Derivative of a curve: Let us see 

which are the smooth (i.e. derivable) curves. Consider c : [0, 1] — > H(n), t E (0, 1) and 
e > sufficiently small. Then the finite difference function associated to c, t, e is 

C £ (t){z) = S' 1 (ecity^t + ez)) 

After a short computation we obtain: 

cm) = ( c{t + £z) ~ c(t) , ~ c{t + e 4 ~ ~ c{t) - Uc(t), c(t + e 4 - c{t) )) 

When e^Owe see that the finite difference function converges if: 

t(t) = ^(c(t),6(t)) 

Hence the curve has to be horizontal; in this case we see that 

Dc(t)z = z(c(t),0) 

This is almost the tangent to the curve. The tangent is obtained by taking z = 1 and 
the left translation of Dc(t)l by c(t). 

The horizontality condition implies that, given a curve 1 1-> c(t) G R 2n , there is only 
one horizontal curve 1 1-> (c(t), c(t)), such that c(0) = 0. This curve is called the lift of 
c. 

Derivative of a functional: Take now / : H(n) — > i? and compute its Pansu 
derivative. The finite difference function is 

F e (x,x)(y,y) = (f(x + £y,x + ^u;(x,y)+£ 2 y)-f(x,x)^/£ 

Suppose that / is classically derivable. Then it is Pansu derivable and this derivative 
has the expression: 

df _ 1 Of 
Df{x,x)(y,y) = —(x,x)y+-u(x,y)—(x,x) 

Not any Pansu derivable functional is derivable in the classical sense. As an example, 
check that the square of any homogeneous norm is Pansu derivable everywhere, but 
not derivable everywhere in the classical sense. 

3.2 Lifts of symplectic diffeomorphisms 

In this section we are interested in the group of volume preserving diffeomorphisms 
of H(n), with certain classical regularity. We establish connections between volume 
preserving diffeomorphisms of H(n) and symplectomorphisms of R 2n . 



3 THE HEISENBERG GROUP 



36 



Volume preserving diffeomorphisms 

Definition 3.2 Dif f 2 (H(n),vol) is the group of volume preserving diffeomorphisms 
4> of H(n) such that <j> and it's inverse have (classical) regularity C 2 . In the same way 
we define Sympl 2 (R 2n ) to be the group of C 2 symplectomorphisms of R 2n . 

Theorem 3.3 We have the isomorphism of groups 

Diff 2 (H(n),vol) « Sympl 2 (R 2n ) x R 

given by the mapping 

/ = (/,/) G Diff 2 (H(n),vol) i ► (f £ Sympl 2 (R 2n ), /(0,0)) 

The inverse of this isomorphism has the expression 

(/ eSympl 2 (R 2n ),aeR) -» / = (/,/) G Diff 2 (H(n),vol) 

f{x,x) = (f(x), x + F(x)) 
where F(0) = a and dF = f*X - A. 

Proof. Let / = (/, /) : H(n) — > -ff(re) be an element of the group 
Diff 2 (H(n),vol). We shall compute: 

Df({x,x))(y,y) = lim^- 1 f (/(»>»)) f i(x, x)5 £ (y,y)) 

We know that Df(x,x) has to be a linear mapping. 

After a short computation we see that we have to pass to the limit e — > in the 
following expressions (representing the two components of Df((x,x))(y,y)): 

-if (x + ey, x + £ 2 y + ^w(x, y)) - f(x, x)) (3.2.1) 



e V \ a 2 

^2 (/ ( x + ^> ^ + e 2 2/ + |w(ac, y)) - /(x, x)- (3.2.2) 

i^ x ' ^ f { x + £ y^ + £2 y + f^fo 2/) 



The first component ( 3.2.1 ) tends to 



5/15/ 

— (x,x)y + - — (x,x)u;(x,y) 

The terms of order e must cancel in the second component ( |3.2.2j ). We obtain the can- 
cellation condition (we shall omit from now on the argument (x,x) from all functions): 

1 .df 1 , „ df , 1 . . . r df. df 

2 y) 5i " 2 W(/ ' c^ y) " 4 w(x ' yM/ ' 9i } + • V = ° (3 - 2 - 3) 
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The second component tends to 



dx 



y 



~v{f, -s=)y 

2 dx 



The group morphism Df(x,x) is represented by the matrix: 





df(x,x) 



lk + 2 % ® Jx 







<2l 



(3.2.4) 



We shall remember now that / is volume preserving. According to proposition 
means: 

lTx + 2dx-® Jx £Spin) 



Of 

dx 



1 df, 
2 W(/ '5i^ 



The cancellation condition ( |3.2.3| ) and relation ( ^.2.6| ) give 



dx' 



2 W(/ ' d-x V) 



this 
(3.2.5) 

(3.2.6) 
(3.2.7) 



These conditions describe completely the class of volume preserving diffeomor- 
phisms of H(n). Conditions ( |3.2.6| ) and Q3.2.7 ) are in fact differential equations for 
the function / when / is given. However, there is a compatibility condition in terms 
of f which has to be fulfilled for ( 3.2.7 ) to have a solution /. Let us look closer to 
( [3.2.7 ). We can see the symplectic form uj as a closed 2-form. Let A be a 1-form such 
that dX = uj. If we take the (regular) differential with respect to x in (|3.2.7 ) we quickly 
obtain the compatibility condition 



df 
dx 



G Sp(n) 



and ( |3,2.7| ) takes the form: 



df = f*\ 



A 



(3.2.8) 



(3.2.9) 



(all functions seen as functions of x only). 

Conditions ( |3.2.8 ) and ( 3.2.5[ ) imply: there is a scalar function fi = n(x,x) such 
that 

df T 



Let us see what we have until now: 



Tx € SP{U) 



(3.2.10) 
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= l + luf ^-) 

dx 2 ' dx 

df T 

d=x = » JX 



(3.2.12) 



(3.2.13) 



Now, differentiate ( 3.2.11 ) with respect to x and use ( |3.2.13| ). In the same time differ- 
entiate ( |3,2.12j ) with respect to x. Prom the equality 

d 2 f d 2 f 



dxdx dxdx 
we shall obtain by straightforward computation /x = 0. 



3.3 Hamilton's equations 

For any element <p £ Sympl 2 (R 2n ) we define the lift <j> to be the image of (</>, 0) by the 
isomorphism described in the theorem |3.3| . 

Let A C R 2n be a set. Sympl 2 (A) c is the group of symplectomorphisms with 
regularity C 2 , with compact support in A. 

Definition 3.4 For any flow t i— » G Sympl 2 (A) c denote by cj) h (-,x)) the horizontal 
flow in H{n) obtained by the lift of all curves 1 1— > 4>(t, x) and by <fi(-, t) the flow obtained 
by the lift of all (j) t - The vertical flow is defined by the expression 

= f!o/ (3.3.14) 



Relation ( 3.3.14 ) can be seen as Hamilton equation. 



Proposition 3.5 Let t € [0, 1] be a curve of diffeomorphisms of H(n) satisfying 

the equation: 

J t <f> v t(x,x) = (0,H(t,x)) , <j> v = id H[n) (3.3.15) 



Then the flow t i— > <pt which satisfies ( |3.3.14| ) and <pQ = id^n is the Hamiltonian flow 
generated by H . 

Conversely, for any Hamiltonian flow t i— > (j) t) generated by H, the vertical flow 



1 1— > (/>% satisfies the equation (|3.3.15|) 



Proof. Write the lifts 4>t and <f>\, compute then the differential of the quantity 
and show that it equals the differential of H. ■ 
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Flows of volume preserving diffeomorphisms We want to know if there is any 
nontrivial smooth (according to Pansu differentiability) flow of volume preserving dif- 
feomorphisms. 

Proposition 3.6 Suppose that 1 1— > (fit £ Diff 2 (H(n),vol) is a flow such that 

- is C 2 in the classical sense with respect to (x,t), 

- is horizontal, that is t i— » (j)t( x ) is a horizontal curve for any x. 
Then the flow is constant. 



Proof. By direct computation, involving second order derivatives. Indeed, let <fit( 
((pt(x),x + Ft{x)). From the condition (fit € Diff 2 (H(n),vol) we obtain 

= )p(<fi t (x),^(x)y) - ~^w{x,y) 

and from the hypothesis that t *— > (fit(x) is a horizontal curve for any x we get 

dF 1 

Equal now the derivative of the RHS of the first relation with respect to t with the 
derivative of the RHS of the second relation with respect to x. We get the equality, for 
any y € R 2n : 

= ~u(^(x)y,<fi t (x)) 

therefore (fit{x) =0. ■ 

One should expect such a result to be true, based on two remarks. The first, 
general remark: take a flow of left translations in a Carnot group, that is a flow t i— > 
4>t{x) = xtx. We can see directly that each <j> t is smooth, because the distribution is left 
invariant. But the flow is not horizontal, because the distribution is not right invariant. 
The second, particular remark: any flow which satisfies the hypothesis of proposition 
|3.6| corresponds to a Hamiltonian flow with null Hamiltonian function, hence the flow 
is constant. 

At a first glance it is disappointing to see that the group of volume preserving 
diffeomorphisms contains no smooth paths according to the intrinsic calculus on Carnot 
groups. But this makes the richness of such groups of homeomorphisms, as we shall 
sec. 



3.4 Volume preserving bi-Lipschitz maps 

We shall work with the following groups of homeomorphisms. 

Definition 3.7 The group Hom(H(n),vol, Lip) is formed by all locally bi-Lipschitz, 
volume preserving homeomorphisms of H(n), which have the form: 

4>(x,x) = (<fi(x), x + F(x)) 
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The group Sympl(R 2n , Lip) of locally bi-Lipschitz symplectomorphisms of R 2n , in 
the sense that for a.e. x G R 2n the derivative D(j>(x) (which exists by classical Rademacher 
theorem) is symplectic. 

Given A C R 2n , we denote by Hom(H(n),vol, Lip)(A) the group of maps <j> which 
belong to Hom(H(n),vol,Lip) such that <p has compact support in A (i.e. it differs 
from identity on a compact set relative to A). 

The group Sympl(R 2n , Lip)(A) is defined in an analogous way. 

Remark that any element <p £ Hom(H(n),vol,Lip) preserves the "vertical" left 
invariant distribution 

(xx) i— > (x,x)Z(H(n)) 

for a.e. (x,x) £ H(n). 

We shall prove that any locally bi-Lipschitz volume preserving homeomorphism of 
H(n) belongs to Hom(H(n),vol, Lip). 

Theorem 3.8 Take any (j) locally bi-Lipschitz volume preserving homeomorphism of 
H(n). Then (ft has the form: 

(j)(x,x) = (4>(x),x + F(x)) 

Moreover 

4> € Sympl(R 2n , Lip) 
F : R 2n — > R is Lipschitz and for almost any point (x, x) € H(n) we have: 

DF(x)y = ^uj(<p(x),D(f)(x)y) - ~u(x,y) 



Proof. Set cj){x) = ((j)(x), (f)(x)). Also denote by 

I (y,y) I 2 = I y I 2 + I y 



By theorem [2.21 <p is almost everywhere derivable and the derivative can be written 



in the particular form: 

(AO 

V o 1 

with A = A(x,x) &Sp(n,R). 

For a.e. x G IR 2n the function (ft is derivable. The derivative has the form: 

D4>(x,x)(y,y) = (Ay,y) 

where by definition 

D(p(x,x)(y,y) = lim5~ 1 [4>(x,x)~ 1 (j)((x,x)5 e (y,y)) 
Let us write down what Pansu derivability means. 
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For the H. component we have: the limit 

lim | -[(/)((x,x)6 £ (y,y))-</)(x,x)} - A \ = (3.4.16) 

e— >0 £ 

is uniform with respect to y € -8(0, 1). We shall rescale all for the ball -B(0, R) £ H(n). 
Relation ( p.4.16|) means: for any A > there exists eq(\) > such that for any R > 
and any e > £q/ R and y £ H(n) such that | y |< R we have 

| i (0(x5 e y) - 0(x)) - ,4y | < XR (3.4.17) 

Suppose that x = (x,x) satisfies \ x \< R with R > 1. Fix now A > and consider 
arbitrary y G i? 2n , | y |< 1 and /i > 0. Set 

There exists a constant C > such that 

\(y,y,\ 2 < i +-(i + ii) 2 



For e' < Eq(\) choose 



fi = i( V / C 2 (l + i?) 4 + 4e' C(l + R) 2 



1/2 



Then the inequality ( 3.4.17 ) is true for R replaced by ^1 + ^(1 + R) 2 J , y = (y, y^), 

with arbitrary | y \< 1. We get: for any A > there is e(A) = eo(X)/2 > such that 
for any e < e(X) and any | y |< 1 we have 

| - (<p(x + ey, x) — <f)(x,x)) — Ay \ < 2A 

Therefore x i— > 4>(x, x) is derivable a.e. with (locally) bounded derivative. It is therefore 
(locally) Lipschitz. 

Let us now return to ( 3.4.16j ) and choose y = 0. We get: the map 

y ^ 4>(x + y,x + -u(x,y)) 

is derivable in y = and the derivative equals A. But we have just proven that the 
derivative of (ft(-,x) in x equals A. We get: <p is constant with respect to x. 
We have proven until now that 

4>(x,x) = (cf)(x),(f)(x,x)) 

with 4> £ Sympl(R 2n , Lip). 

Let us use this information in the definition of the center component of the Pansu 
derivative. We have: 

| 4, U{xS e y) - 4>{x) - \u{<t>{x), 4>{x + ey)) \ - y- (3.4.18) 
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--u(-(<l)(x + ey) - </>(x)),Ay) | 

as e — > 0, uniformly with respect to | y |< 1. Prom previously proved facts we obtain 
that ( 3.4.18() is equivalent to 

I ^2 y>{x8 e y) ~ 4>(x) - ~w(<l)(x), 4>(x + ey) - <f>(x)) \ - y \ -> 

as e — > 0, uniformly with respect to y. We quickly get that <j>{x,x) = x + F(x). 
All in all the Pansu derivability (center component) reads 

lim | -[F(x + ey)-F(x)] + — u(x,y) - -u(<t)(x), ^(x+ey) - <l>(x)]) \= (3.4.19) 
e^o e z 2e 2 e z 

This means that for almost any x G M 2ri 

1 the function <j> is derivable and the derivative is equal to A = A(x), 

2 the function F is derivable and is connected to <f> by the relation from the con- 
clusion of the proposition. 

We have seen that (ft is locally Lipschitz implies that x E M? n i— > A[x) is locally 
bounded, therefore <j> is locally Lipschitz. In the same way we obtain that F is locally 
Lipschitz. ■ 

The flows in the group Hom(H(n),vol, Lip) are defined further. 

Definition 3.9 A flow in the group Hom(H(n),vol, Lip) is a curve 

i i — > (f>t G Hom(H(n),vol, Lip) 

such that for a.e. x £ i? 2n the curve t i— > 0t(x) £ i? 2n zs (locally) Lipschitz. 

For any flow we can define the horizontal lift of this flow like this: a.e. curve 
t i— > (/>t(x) Zi/ts to a horizontal curve 1 1— > <p+(x, 0). Define then 

$(x,x) = tf(x,0)(0,x) 



If the flow is smooth (in the classical sense) and ^ £ Dif f 2 (H(n),vol) then this 
lift is the same as the one described in definition 13.41. We can define now the vertical 



flow by the formula ( 3.3.14 ), that is 



There is an analog of proposition 3.6. In the proof we shall need lemma 3.11, which 
comes after. 



Proposition 3.10 Lett i— ► £ Hom(H(n),vol, Lip) be a curve such that the function 
t i— > <&(x,t) = (<pt(x),t) is locally Lipschitz from H(n) x R to itself. Then t i— » <f>t is a 
constant curve. 
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Proof. By Rademacher theorem |2.15 for the group H(n) x R we obtain that <I> is 



almost everywhere derivable. Use now lemma 3.11 to deduce the claim. ■ 

A short preparation is needed in order to state the lemma 3.11. Let N be a non- 
commutative Carnot group. We shall look at the group N xR with the group operation 
defined component wise. This is also a Carnot group. Indeed, consider the family of 
dilatations 

S E (x,t) = (5 £ {x),et) 

which gives to N x R the structure of a CC group. The left invariant distribution on 
the group which generates the distance is (the left translation of) W\ = V\ X R. 

Lemma 3.11 Let N be a noncommutative Carnot group which admits the orthogonal 
decomposition 

N = Vi + [N,N] 

and satisfies the condition 

VinZ(N) = 
The group of linear transformations of N x R is then 

HL(N xR ) = \( ^ ° d ) HL(N) , ceVi , d£R 



Proof. We shall proceed as in the proof of proposition 3J.. We are looking first at 
the Lie algebra isomorphisms of N x R, with general form 

/ A b 
\ c d 

We obtain the conditions: 

(i) c orthogonal on [N, N] , 

(ii) b commutes with the image of A: [b,Ay] = 0, for any y € N, 

(iii) A is an algebra isomorphism of N. 

From (ii), (iii) we deduce that b is in the center of iV and from (i) we see that c € V\. 
We want now the isomorphism to commute with dilatations. This condition gives: 

(iv) b G V u 

(v) A commutes with the dilatations of N. 

(iii) and (v) imply that A £ HL(N) and (iv) that b = 0. ■ 
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Hamiltonian diffeomorphisms: more structure In this section we look closer 
to the structure of the group of volume preserving homeomorphisms of the Heisenberg 
group. 

Let Hom h (H(n),vol, Lip) (A) be the group of time one homeomorphisms t i— > <fi h , 
for all curves t \— * fa G Sympl(R 2n , Lip) (A) such that (x,t) i— > (fa(x),t) is locally 
Lipschitz. 

The elements of this group are also volume preserving, but they are not smooth 
with respect to the Pansu derivative. 

Definition 3.12 The group Hom(H(n),vol)(A) contains all maps <fi which have a.e. 
the form: 

4>{x,x) = ((j>(x),x + F(x)) 

where <fi € Sympl(R 2n , Lip) (A) and F : R 2n — » R is locally Lipschitz and constant 
outside a compact set included in the closure of A (for short: with compact support in 
A). 

This group contains three privileged subgroups: 

- Hom(H(n),vol, Lip)(A), 

- Hom h (H(n),vol,Lip)(A) 

- and Horn" (H(n),vol, Lip) (A). 

The last is the group of vertical homeomorphisms, any of which has the form: 

(fi"(x,x) = (x,x + F(x)) 

with F locally Lipschitz, with compact support in A. 

Take a one parameter subgroup t i— ► fa £ Hom(H(n),vol, Lip) (A), in the sense 
of the definition We know that it cannot be smooth as a curve in in the group 
Hom(H(n),vol,Lip)(A), but we also know that there are vertical and horizontal flows 
1 1 — * 0£ , $ such that we have the decomposition fa o ^ = Unfortunately none 
of the flows 1 1— ► 0£ , 4>t are one parameter groups. 

There is more structure here that it seems. Consider the class 

HAM(H(n))(A) = Hom(H(n),vol, Lip)(A) x Horn" {H{n),vol, Lip){A) 

For any pair in this class we shall use the notation (cj),(p v ) This class forms a group 
with the (semidirect product) operation: 

(fa<f> v )$,lf> V ) = (^^fo^ofof 1 ) 

Proposition 3.13 If t i— > fa € Hom(H(n),vol, Lip) (A) is an one parameter group 
then t i — > (fa, <j>t) £ HAM(H(n)) is an one parameter group. 
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Proof. The check is left to the reader. Use definition and proposition 1, chapter 5, 
Hofer & Zehnder jl6|, page 144. ■ 

We can (indirectly) put a distribution on the group HAM(H(n))(A) by specifying 
the class of horizontal curves. Such a curve t > (4>t, <t>t) m the group HAM(H(n))(A) 
projects in the first component t fa to a flow in Hom(H(n),vol, Lip){A). Define for 
any (4>,(p v ) € HAM(H(n))(A) the function 

(f) h = 0O(j) V 

and say that the curve 

i £ HAM(H(n))(A) 

is horizontal if 

t 4>t{x,x) 

is horizontal for any (x, x) £ H(n). 

We introduce the following length function for horizontal curves: 

= j \\4> v t \\ L ~{A) dt 

With the help of the length we are able to endow the group HAM(H(n))(A) as a path 
metric space. The distance is defined by: 

over all horizontal curves t i— > (0f , <j>^) such that 

(<Ao,^) = (j>iA v i) 
$i,ct>l) = {hi Ah) 

The distance is not defined for any two points in HAM(H(n))(A). In principle the 
distance can be degenerated. 

The group HAM(H(n))(A) acts on AZ (where Z is the center of H(n)) by: 

{(t),4> v ){x,x) = 4> v o 4>(x,x) 

Remark that the group 

Horn 1 ' (H(n), vol, Lip)(A) = {id} x Hom v (H(n),vol, Lip)(A) 

is normal in HAM(H(n))(A). We denote by 

HAM(H(n))(A) /Horn" (H(n) , vol, Lip) (A) 

the factor group. This is isomorphic with the group Sympl(R 2n , Lip) (A) . 

Consider now the natural action of Horn" (H{n), vol, Lip){A) on AZ. The space of 
orbits AZ/Hom"{H{n),vol,Lip){A) is nothing but A, with the Euclidean metric. 
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Factorize now the action of HAM(H(n))(A) on AZ, by the group 

Hom v (H(n),vol, Lip)(A) 

We obtain the action of HAM(H(n))(A)/Hom v (H(n),vol,Lip)(A) on 

AZ/Hom v (H(n),vol, Lip)(A) 

Elementary examination shows this is the action of the symplectomorphisms group on 
R 2n . 

All in all we have the following (for the definition of the Hofer distance see next 
section): 

Theorem 3.14 The action of HAM{H{n)){A) on AZ descends after reduction with 
the group Hom v (H(n),vol, Lip) to the action of symplectomorphisms group with com- 
pact support in A. 

The distance dist on HAM(H(n)) descends to the Hofer distance on the connected 
component of identity of the symplectomorphisms group. 



3.5 Symplectomorphisms, capacities and Hofer distance 

Symplectic capacities are invariants under the action of the symplectomorphisms group. 
Hofer geometry is the geometry of the group of Hamiltonian diffeomorphisms, with 
respect to the Hofer distance. For an introduction into the subject see Hofer, Zehnder 
fil| chapters 2,3 and 5, and Polterovich [28|, chapters 1,2. 

A symplectic capacity is a map which associates to any symplectic manifold (M, w) 
a number c(M,uj) € [0,+oo]. Symplectic capacities are special cases of conformal 
symplectic invariants, described by: 

Al. Monotonicity: c(M, to) < c(N,r) if there is a symplectic embedding from M to 

N, 

A2. Conformality: c(M, euj) =\ e \ c(M,u>) for any a € R, a ^ 0. 

We can see a conformal symplectic invariant from another point of view. Take a 
symplectic manifold (M, u) and consider the invariant defined over the class of Borel 
sets B(M), (seen as embedded submanifolds). In the particular case of R 2n with the 
standard symplectic form, an invariant is a function c : B(R 2n ) — > [0, +oo] such that: 

Bl. Monotonicity: c(M) < c(N) if there is a symplectomorphism (j) such that (j)(M) C 
N, 

B2. Conformality: c(eM) = e 2 c(M) for any e G R. 

An invariant is nontrivial if it takes finite values on sets with infinite volume, like 
cylinders: 

Z(R) = {x£ R 2n : x\ + xl<R} 
There exist highly nontrivial invariants, as the following theorem shows: 
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Theorem 3.15 (Gromov's squeezing theorem) The ball B(r) can be symplectically em- 
bedded in the cylinder Z{R) if and only if r < R. 

This theorem permits to define the invariant: 

c(A) = sup{R 2 : 3(p(B(R)) C A} 

called Gromov's capacity. 

Another important invariant is Hofer-Zehnder capacity. In order to introduce this 
we need the notion of a Hamiltonian flow. 

A flow of symplectomorphisms t i— > is Hamiltonian if there is a function H : 
M x R — > R such that for any time t and place x we have 

io((pt(x),v) = dH((p t (x),t)v 

for any v G T Mx) M. 

Let H(R 2n ) be the set of compactly supported Hamiltonians. Given a set A C R 2n , 
the class of admissible Hamiltonians is H(A), made by all compactly supported maps 
in A such that the generated Hamiltonian flow does not have closed orbits of periods 
smaller than 1. Then the Hofer-Zehnder capacity is defined by: 

hz(A) = sup{||#||oo : H G H(A)} 

Let us denote by Ham(A) the class of Hamiltonian diffeomorphisms compactly 
supported in A. A Hamiltonian diffeomorphism is the time one value of a Hamiltonian 
flow. In the case which interest us, that is R 2n , Ham(A) is the connected component 
of the identity in the group of compactly supported symplectomorphisms. 

A curve of Hamiltonian diffeomorphisms (with compact support) is a Hamiltonian 
flow. For any such curve t i— > c(t) we shall denote by t i— > H c (t, •) the associated 
Hamiltonian function (with compact support). 

On the group of Hamiltonian diffeomorphisms there is a bi-invariant distance intro- 
duced by Hofer. This is given by the expression: 

d H (<f>,il>) = inf {_^ \\ H c(t)\\oo,B?" dt : c: [0, 1] ^ Ham{R 2n )^ (3.5.20) 

It is easy to check that d is indeed bi-invariant and it satisfies the triangle property. 
It is a deep result that d is non-degenerate, that is d(id, 4>) = implies <p = id. 

With the help of the Hofer distance one can define another symplectic invariant, 
called displacement energy. For a set A C R 2n the displacement energy is: 

de{A) = inf {d H (id, 0) : G Ham{R 2n ) , <f)(A) n A = 0} 

A displacement energy can be associated to any group of transformations endowed 
with a bi-invariant distance (see Eliashberg & Polterovich Q, section 1.3). The fact that 
the displacement energy is a nontrivial invariant is equivalent with the non-degeneracy 
of the Hofer distance. Section 2. Hofer & Zehnder |l(| is dedicated to the implications 
of the existence of a non-trivial capacity. All in all the non-degeneracy of the Hofer 
distance (proved for example in the Section 5. Hofer &: Zehnder [jl6|) is the cornerstone 
of symplectic rigidity theory. 
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3.6 Hausdorff dimension and Hofer distance 

We shall give in this section a metric proof of the non-degeneracy of the Hofer distance. 

A flow of symplectomorphisms t h- * fa with compact support in A is Hamiltonian 
if it is the projection onto R 2n of a flow in Hom(H(n),vol, Lip) (A). 

Consider such a flow which joins identity id with <f>. Take the lift of the flow 
t G [0, 1] fa G Hom(H(n),vol, Lip) (A). 

Proposition 3.16 The curve 1 1— > fa(x, 0) has Hausdorff dimension 2 and measure 

H 2 (t^fa{x,x)) = J \H t (fa{x))\ dt (3.6.21) 



Proof. The curve is not horizontal. The tangent has a vertical part (equal to the 
Hamiltonian). Use the definition of the (spherical) Hausdorff measure H 2 and the 
Ball-Box theorem 2J} to obtain the formula ( |3.6.21[) . ■ 

We shall prove now that the Hofer distance (3.5.20) is non-degenerate. For given 
(f> with compact support in A and generating function F, look at the one parameter 
family: 

4> a (x,x) = {<t>(x),x + F(x) + a) 
The volume of the cylinder 

{(x, z) : x G A, z between and F{x) + a) 

attains the minimum V(4>, A) for an ao € R. 

Given an arbitrary flow t G [0, 1] i— ► fa G Hom(H(n),vol, Lip) (A) such that 0o = 
id and fa = 4> a , we have the following inequality: 

V(faA) < J H 2 {(t^ Mx,0)j dx (3.6.22) 

Indeed, the family of curves t *— > fa(x, 0) provides a foliation of the set 

{fa(x,0) : xGi} 

The volume of this set is lesser than the RHS of inequality ( 3.6.22 ): 

C Vol ^fa(x,0) : x e A^ < J H 2 (ti-> fa(x,ofj dx (3.6.23) 

where C > is an universal positive constant (that is it depends only on the algebraic 
structure of H(n)). This is a statement which is local in nature; it is simply saying that 
the area of the parallelogram defined by the vectors a, b is smaller than the product 
of the norms ||a|[||b||. We give further a proof of this inequality. 

Let A C H(n) be a set with finite Hausdorff measure Q = 2n + 2 which contains a 
set B = B x {0} such that there is a flow 

t G [0, l]^fa: H(n) -» H(n) 



with the properties: 
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(a) (po(x) = x for any x £ B, 

(b) for almost any t <pt preserves the Hausdorff measure Q, 

(c) for any t € [0, 1] 4>t is bi-Lipschitz and the Lipschitz constants of (fit, (ft^ 1 are 
upper bounded by a positive constant Const, 

(d) A = {Mx) :t€[0,l], x € B}, 

(e) for Ti. 2n almost any x G B the fiber f x = {</>t(x) : t € [0, 1]} has finite Hausdorff 
dimension P = 2. 



Then 



c < f n 2 (f x ) d( x ) 

Jb 

Indeed, we start from the isodiametric inequality in the Heisenberg group: 

(diam L)) Q > cH Q {D) 



We can rescale the Hausdorff measure Q in order to have c = 1. Consider also the 
projection ir : H(n) — > R 2n , tt(x,x) = x. We can rescale the Lebesgue measure on 
R 2n such that the volume of the projection of the ball B cc (x, 1) equals 1. Here we 
have used the notation: B cc \x, R) is the ball in the Carnot-Caratheodory distance in 
H(n), and B E (x,R) is the Euclidean ball of radius R in R 2n . 

Fix e small and cover B with Euclidean balls of radius e B E (xj,e), j € J e , in an 
efficient way, i.e. such that J £ e 2n tends to vol(B) when e — ► 0. We want now to cover 
A with sets of the form 

4> tk (B cc (( Xj ,0),e)) 

where k depends on Xj. How many sets do we need? 

From the isodiametric inequality and the fact that <pt is volume preserving, we see 
that 

diam (f> t (B cc {(xj, 0), e)) > e 

Put this together with the fact that the fiber f x . has Hausdorff dimension P = 2 
and get that we need about H p (f Xj ) / e p such sets to cover the fiber f Xj . The sum 
of volumes of the cover is greater then the volume of A. Tend e to and obtain the 
claimed inequality ( 3.6.23| ). The constant C comes from the rescalings of the measures. 



Without rescalling we have: 

vol(B cc (0,1)) 



C 



vol{B E (0,l) 



We go back now to the main stream of the proof and remark that by continuity of 
the flow the volume of the set 



jc/> 4 (x,0) : x E -A | 



is greater than V(4>,A). 
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For any curve of the flow we have the uniform obvious estimate: 

H 2 ((t^Mx,0)) < J \\H t {<ft t {x))\\ At00 dt (3.6.24) 

Put together inequalities ( 3.6.22| ), ( |3.6.24| ) and get 

CV(<j>,A) < f \\H t (<ft t (x))\\ Ai00 dt (3.6.25) 
J o 

Use the definition of the Hofer distance ( |3.5.20| ) to obtain the inequality 

CV(<f>,A) < vol(A) d H {id,4>) (3.6.26) 

This proves the non-degeneracy of the Hofer distance, because if the RHS of (|3.6.26| ) 
equals then V((ft, A) is 0, which means that the generating function of (ft is almost 
everywhere constant, therefore (ft is the identity everywhere in R 2n . 

We close with the translation of the inequality (3.6.25) in symplectic terms. 

Proposition 3.17 Let (ft be a Hamiltonian diffeomorphism with compact support in 
A and F its generating function, that is dF = (ft*\ — A, where dX = u. Consider 
a Hamiltonian flow t i— > H t , with compact support in A, such that the time one map 
equals (ft. Then the following inequality holds: 

C inf I J \F(x)-c\ cfc.-ceflj < vol(A) J WH^a dt 

with C > an universal constant equal to 

vol(B cc (0A)) 
vol{B E {0A) 

3.7 Invariants of volume preserving maps 

Theorem |3.8| gives the strucure of a volume preserving locally bi-Lipschitz homeo- 
morphism. We have denoted by Hom(H(n),vol, Lip) the class of these maps. Any 
(ft E Hom(H(n),vol,Lip) can be written as 

(ft(x,x) = ((ft(x), x + F(x)) 

where (ft E Sympl(R 2n , Lip) is a locally bi-Lipschitz symplectomorphism of R 2n . For a 
set B C R 2n we denote by vol(B) it's Lebesgue measure. For a set C C R 1(C) will 
be it's length (one dimensional Lebesgue measure). An immediate consequence of the 
structure theorem ^8 is given further. 

Corollary 3.18 Let A C H(n) be a H 2n+2 measurable set such that U 2n (A) < 
+oo, where A denotes the orthogonal projection of A on R 2n . Then for any (ft E 
Hom(H(n),vol, Lip) we have: 

H 2n+2 (A) H 2n+2 $(A)) 
W{ ' ~ vol(A) ~ vol{(ft(A)) 
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We call w(A) the width of A. 

More general, for any i > 1 define the i-height of A to be 

Thenhi(A) = hi(<j>(A)). 

Proof. Any symplectomorphism is volume preserving. The thesis follows from this 
and the structure theorem ■ 

Suppose A is open, bounded and classically smooth. In the case of the Heisenberg 
group H(i) Pansu [|?J proved the following isoperimetric inequality: 

n A {A) < c(n 3 (dA 

The isoperimetric inequality is a very important subject in analysis in metric spaces. 
It has been intensively studied in it's disguised form (Sobolev inequalities) in many 
papers. To have a grasp of what is hidden here the reader can consult Varopoulos 
& al. Hajlasz, Koskela [14| and Garofalo, Nhieu []n|. In order to give a decent 



exposition of this subject we would need a good notion of perimeter. In this first part 
we don't enter much into the measure theoretic aspects though. 

Let us work with objects which have classical regularity, as in Gromov [12] section 



2.3. There we find a more general isoperimetric inequality, true in particular in any 
Carnot group, provided that A is open, bounded, with (classically) smooth boundary. 
If we denote by Q the homogeneous dimension of the Carnot group N (which equals 
its Hausdorff dimension) then Gromov's isoperimetric inequality is: 

Q 

n Q (A) < c(n Q -\dAy 



Recall that for Heisenberg group H(n) the homogeneous dimension is Q = 2n + 2. 

We should use further the group Dif f 2 (H(n),vol) instead of Hom{H(n), vol, Lip), 
because we work with classical regularity . We kindly ask the reader to believe that a 
good definition of the perimeter of the set A allows us to use the group Hom(H(n),vol, Lip). 
We shall denote the perimeter function by Per. In "smooth" situations Per {d A) = 
H Q ~ l (dA). 

We can define then the isoperimeter of A to be: 



isop(A) 2n+1 = inf 



_ „ \ 2n+2 

Per d(j>(A)\ 
(H 2n (A)) 



2n+l 



with respect to all <f> E Hom(H(n),vol, Lip). This is another invariant of the group of 
volume preserving locally bi-Lipschitz maps. 

Yet another invariant is the isodiameter. This is defined as 

. - , (diam 4>{A)) 2n+2 
zsod(A) = inf^ 



4 SUB-RIEMANNIAN LIE GROUPS 



52 



with respect to all <fi £ Hom(H(n),vol, Lip). Because the diameter of A is greater than 
the square root of the oo height of A, it follows that 

. ~ h 00 (A) n+1 w(A) 
isod(A) > = 

v " n 2n+2 (A) 

All these invariants scale with dilatations like this (denote generically by inv any 
of the mentioned invariants): 

inv 5 £ A = e 2 A 

The groupoid 7iom{H{n),vol, Lip) with objects open sets H (n) and arrows volume 
preserving locally bi-Lipschitz maps between then, can be considered instead of the 
group Hom(H(n),vol, Lip). Then the heights are no longer invariants. Indeed, set A 
to be the disjoint union of two isometric sets. Then we can keep one of these sets fixed 
and put on the top of it the other one, using only translations. The width will change. 
We can define other invariants though, using the heights and the number of connected 
components, for example. 

By using lifts of symplectic flows one can define symplectic invariants. Conversely, 
with any symplectic capacity comes an invariant of Hom(H(n),vol,Lip). Indeed, let c 
be a capacity. Then c(A) = c{A) is an invariant. 

Is there any general definition of a Hom(H(n),vol, Lip) invariant which has as 
particular realization heights and symplectic capacities? 

How about the invariants of the group Hom(N,vol, Lip), where N is a general 
Carnot group? We leave this for further study. 

4 Sub-Riemannian Lie groups 



In this section we shall look closer to the last example of section L5 . Let G be a real 
connected Lie group with Lie algebra q and Dcga vector space which generates the 
algebra. This means that the sequence 

Vi = D , V i+1 = V i + [D, V*] 

provides a filtration of q: 

V 1 C V 2 C ... V m = q (4.0.1) 

4.1 Nilpotentisation 

The filtration has the straightforward property: if x € V % and y € V 3 then [x, y] € V t+3 
(where V k = g for all k > m and V° = {0}). This allows to construct the Lie algebra 

n(g,D) = ®T=iVi , Vi = V'/V'- 1 

with Lie bracket [x, y] n = [x, y], where x = x + V 1 ' 1 if x G V 1 \ V 1 " 1 . 



Proposition 4.1 n(g,D) with distinguished space V\ = D is the Lie algebra of a 
Carnot group of step m. It is called the nilpotentisation of the filtration ( 4.0.1 ) . 
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Remark that n(g, D) and g have the same dimension, therefore they are isomorphic as 
vector spaces. 

We set the distribution induced by D to be 

D x = TL X D 

where x £ G is arbitrary and L x : G — > G, L x y = xy is the left translation by x. We 
shall use the same notation D for the induced distribution. 

Let {Xi, X p } be a basis of the vector space D. We shall build a basis of g which 
will give a vector spaces isomorphism with n(g, D). 

A word with letters A = {X±, ...,X p } is a string X h (iy..X h ^ where h : {1, s} — > 
{1, ...,p}. The set of words forms the dictionary Diet (A), ordered lexicographically. We 
set the function Bracket : Dict(A) — > g to be 

Bracket(X h{1) ...X h ( s) ) = [X h[1) ,[X h{2 ),[-; x h(s)}--} 

For any x £ Bracket{Dict{A)) let rr € Dict(A) be the least word such that Bracket(x) = 
x. The collection of all these words is denoted by g. 

The length l(x) = length(x) is well defined. The dictionary Dict(A) admit a 
filtration made by the length of words function. In the same way the function I gives 
the filtration 

V 1 n Bracket(Dict(A)) C V 2 n Bracket(Dict(A)) C ... Bracket{Dict(A)) 

Choose now, in the lexicographic order in g, a set B such that B = Bracket(B) is a 
basis for g. Any element X in this basis can be written as 

X = [ X h{l)i[ X h{2),[—i X h(s)}—] 

such that l(X) = s or equivalently X G V s \ V 13 ^ 1 . 
It is obvious that the map 

X € B ^X = I + V'W^en^D) 

is a bijection and that B = |Xj : j = l,...,dim g| is a basis for n(g,D). We can 

identify then g with n(g, D) by the identification Xj = Xj. 

Equivalently we can define the nilpotent Lie bracket [•, -] n directly on g, with the 
use of the dilatations on n(g,D). 

Instead of the filtration ( |4 . . l| ) let us start with a direct sum decomposition of g 

g = ®f =1 Wi , V i = ®i =1 Wj (4.1.2) 

such that [V\V j ] C V i+j . The chain V i form a filtration like (pUl) . 
We set 

m 

S £ (x) = ^e'xi 

i=l 

for any e > and x £ g, which decomposes according to 



LL2) as x = Yli 
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Proposition 4.2 The limit 



[x,y] n = lhn5 £ 1 [6 £ x,5 £ y] (4.1.3) 



exists for any x,y E g and (g, [•, -] n ) is the Lie algebra of a Carnot group with dilatations 

s £ . 

Proof. Let x = Yl)=i x j an d V = Ylk=iVk- Then 

i+l s 

[S £ x,5-ey] = ^ ^Z^'^p 

s=2 j+fc=sp=l 

We apply J^ 1 to this equality and we obtain: 

i+l s 
s=2 j+k=s p=l 

When e tends to the expression converges to the limit 

i+l 

i x ,y]n = Y ^2 \. x ^yk\s ( 4 -!.4) 

s=2 j+k=s 

For any e > the expression 

[x,y] £ = ^[SeX^-sy] 

is a Lie bracket (bilinear, antisymmetric and it satisfies the Jacobi identity). Therefore 
at the limit e — > we get a Lie bracket. Moreover, it is straightforward to see from the 
definition of [x, y] n that 5 £ is an algebra isomorphism. We conclude that (g, *]n) is 
the Lie algebra of a Carnot group with dilatations 6 £ . ■ 

Proposition 4.3 Let {Xi, Xdim s } be a basis of g constructed from a basis {X±, ...,X P } 
ofD. Set 

Wj = span {X, : 1{X % ) = j} 

Then the spaces Wj provides a direct sum decomposition ( |4.1,2| ) . Moreover the identi- 
fication Xi = Xi gives a Lie algebra isomorphism between (g, [•, -] n ) and n(g,D). 

Proof. In the basis {X±, X^im s } the Lie bracket on g looks like this: 

where Cijk = if l(X{) + l(Xj) < l(Xk)- From here the first part of the proposition 
is straightforward. The expression of the Lie bracket generated by the decomposition 
( f4.1.2|) is obtained from (|4.1.4|) ). We have 
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where \n-j = 1 if £(Xj) + l(Xj) = l{Xk) and otherwise. The Lie algebra isomorphism 
follows from the expression of the Lie bracket on xi{%,D): 

[x h xA = [x h xA + vmy+m-i 

■ 

In conclusion the expression of the nilpotent Lie bracket depends on the choice of 
basis B trough the transport of the dilatations group from n(g,D) to q. 

Let N(G,D) be the simply connected Lie group with Lie algebra n(g,D). As 
previously, we identify N(G, D) with n(g, D) by the exponential map. 



4.2 Commutative smoothness for uniform groups 



This section is just a sketch and certainly needs to be improved. It is inspired from 
Bellaiche ||, last section. I think that by improving the notions explained here and 



by making axioms from the conclusions of lemma 4.13 and Ball-Box theorem 4. IS, 
then it is possible to avoid the use of the notion of Lie group. For example, one could 



use consequences of proposition 4.7 in order to construct a general manifold structure, 



noncommutative derivative, moment map, as in sections |4.3] , |4.5| , [4.7 . 

We start with the following setting: G is a topological group endowed with an uni- 
formity such that the operation is uniformly continuous. More specifically, we introduce 
first the double of G, as the group G^ = G x G with operation 

(x,u){y,v) = (xy,y~ l uyv) 

The operation on the group G, seen as the function 

op : G^ — > G , op(x, y) = xy 

is a group morphism. Also the inclusions: 

i' :G-» G< 2 > , i'{x) = (x,e) 

i" : G^ G (2) , i"(x) = (x,^ 1 ) 

are group morphisms. 



Definition 4.4 G is an uniform group if we have two uniformity structures, on G and 
G 2 , such that op, i' , i" are uniformly continuous. 

A local action of a uniform group G on a uniform pointed space (X, xq) is a function 
<j) € W G V(e) i — > <f> : 6 V(xq) — > V$ G V(xo) such that the map (0, x) i— > <p(x) is 
uniformly continuous from G x X (with product uniformity) to to X. Moreover, the 
action has the property: for any 4>,tp € G there is D G V(xq) such that for any x £ D 
(j)il)~ l {x) and cf){il)~ l {x)) make sense and ^^{x) = 

Finally, a local group is an uniform space G with an operation defined in a neigh- 
bourhood of (e, e) C G x G which satisfies the uniform group axioms locally. 

Remark that a local group acts locally at left (and also by conjugation) on itself. 
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Definition 4.5 A conical local uniform group N is a local group with a local action of 
(0, +oo) by morphisms 5 £ such that lim £ ^o b~e% = e for any x in a neighbourhood of 
the neutral element e. 

We shall make the following hypotheses on the local uniform group G: there is a 
local action of (0, +oo) (denoted by 5), on (G, e) such that 

HO. 5 £ (e) = e and lim e ^o 5 £ x = e uniformly with respect to x. 

HI. the limit 

P(x,y) = limj; 1 ((5 £ x)(5 £ y)) 
is well defined in a neighbourhood of e and the limit is uniform. 
H2. the limit 

x- 1 = W7 1 (Ok*)- 1 ) 
is well defined in a neighbourhood of e and the limit is uniform. 

Proposition 4.6 Under the hypotheses HO, HI, H2 (G,(3) is a conical local uniform 
group. 

Proof. All the uniformity assumptions permit to change at will the order of taking 
limits. We shall not insist on this further and we shall concentrate on the algebraic 
aspects. 

We have to prove the associativity, existence of neutral element, existence of inverse 
and the property of being conical. The proof is straightforward. For the associativity 
P(x,P(y,z)) = P(P(x, y), z) we compute: 

(3(x,0(y,z)) = lim C 1 { (5 £ x)5 £/v ((<S„i/)(<$„3))} 

We take e = n and we get 

= /3(x,/3(y,z)) = \im{(5 £ x)(5 £ y)(8 £ z)} 

£—►0 

In the same way: 

0(fi(x,y),z) = lim C 1 {(5 e/ x) {{5 v x){5 v y)) (5 £ z)} 

and again taking e = rj we obtain 

P(J3(x,y),z) = lim {(6 £ x)(5 £ y)(5 £ z)} 

e— >0 

The neutral element is e, from HO (first part): P(x,e) = P(e,x) = x. The inverse of 
x is by a similar argument: 

I3(x,x~ l ) = lim 5 £ l {{8 £ x){5 £h {5 v xy 1 )} 
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and taking e = rj we obtain 

^(x,^ 1 ) = lim^ 1 ({5 £ x)(5 £ x)- 1 ) = limJj^e) = e 

Finally, (3 has the property: 

(3(5 v x,S v y) = 5 v (3(x,y) 

which comes from the definition of (3 and commutativity of multiplication in (0, +oo). 
This proves that (G, (3) is conical. ■ 

We arrive at a natural realization of the tangent space to the neutral element. Let 
us denote by [/, g] = f o g o f^ 1 o g^ 1 the commutator of two transformations. For 
the group we shall denote by L x y = xy the left translation and by L x y = (3(x,y). 
The preceding proposition tells us that (G, (3) acts locally by left translations on G. 

Proposition 4.7 We have the equality 

Um o 5^[S £ ,Lf 5xxyi ]S x = L N X (4.2.5) 

and the limit is uniform with respect to x. 

Proof. We shall use the equality: 

lim «5 £ - 1 (/3(((5 £ x),«5 £ y))) = lini yS £ {(3( X -\5-\(3{x,y)))) (4.2.6) 

which is a consequence of HO and the previous proposition. The left hand side of this 
equality equals f3(x, y). We shall look now at the right hand side (RHS) of the equality. 
The uniformity assumptions and some computations lead us to the following equality, 
under the constraint < max {fj,, A, n} < e: 

RHS = lim 5 £/m {<5 m /a {fax)' 1 ) <*„/(„,) ((5 v x)(5 v y))} 

(e,|U,A,r))->-0 

The mentioned constraint forces all expressions to make sense. Take now \i = A = rj < e 
and get 

RHS = lim 6- 1 6 £ (((S x x)- 1 )S- 1 ((S x x)(S x y))) 

£ 5 A — 

as desired. ■ 

An easy corollary is that 

Definition 4.8 The group VT e G formed by all transformations L x is called the virtual 
tangent space at e to G. 
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The virtual tangent space VT X G at x € G to G is obtained by translating the group 
operation and the dilatations from e to x. This means: define a new operation on G 
by 

y ■ z = yx z 

The group G with this operation is isomorphic to G with old operation and the left 
translation L x y = xy is the isomorphism. The neutral element is x. Introduce also 
the dilatations based at x by 

6*y = xS e {x~ x y) 

Then G x = [G, •) with the group of dilatations 5 X satisfy the axioms Ho, HI, H2. 
Define then the virtual tangent space VT X G to be: VT X G = VT X G X . A short 



computation using proposition f4.7| shows that 



VT X G = [L%> x = L x L^. ly L x :yeU x £ V(X)} 

where 

L y' x = x ^o r ^ [s ^ L ?^-^ 

We shall introduce the notion of commutative smoothness, which contains a deriva- 
tive resembling with Pansu derivative. Independently, the author introduced a general 
"topological" derivative in Q (there are a lot of typographical errors in this reference, 
but the ideas behind are quite nice in my opinion). Lack of knowledge stopped the 
development of this notion until recently, when I have learned about Carnot groups 
and Pansu derivative. As it will be seen further, interesting information will be found 
when noncommutative smoothness is considered. 

Definition 4.9 A function f : G\ — > G2 is commutative smooth, where G\, G2 are two 
groups satisfying HO, HI, H2, if the application 

( x?u )eG? ) ^ (f(x),Df(x)u)£G { 2 2) 

exists and it is continuous, where 

Df(x)u = limC 1 ^)- 1 /^)) 

and the convergence is uniform with respect to (x,u). 

For example the left translations L x are commutative smooth and the derivative 
equals identity. If we want to see how the derivative moves the virtual tangent spaces 
we have to give a definition. 

Definition 4.10 Le f : G — > G be a commutative smooth function. The virtual tangent 
to f is defined by: 

VTf(x):VT x G^VT f(x) G , VTf(x)L^ = L f{x) L% f(x)y Ljl x) 
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With this definition L x is commutative smooth and it's virtual tangent in any 
point y is a group morphism from VT y G to VT xy G. More generally, reasoning as in 
proposition 2.14, we get: 

Proposition 4.11 The virtual tangent is morphism of conical groups. 

Nevertheless the right translations are not commutative smooth. This failure pushes 
us to consider a notion of noncommutative derivative. We shall meet the same failure 
soon, when looking to the manifold structure, in the sense of definition [2.20 , of the 
group G. 

Now that we have a model for the tangent space to e at G, we can show that the 
operation is commutative smooth. 

Proposition 4.12 Let G satisfy HO, HI, H2 and sf ] : G^ -> G^ be defined by 

SP(x,u) = (S £ x,5 £ y) 
Then satisfies HO, HI, H2, op is commutative smooth and we have the relation: 

D op (x,u)(y,v) = P(y,v) 



Proof. It is sufficient to use the morphism property of the operation. Indeed, the 
right hand side of the relation to be proven is 

RHS = lim d^ 1 (op{x, u)~ 1 op(x, u)op (y, = 
= lim^ 1 (qp$ a > = (3(y,v) 

The rest is trivial. ■ 

4.3 Manifold structure 

The notion of virtual tangent space is not based on the use of distances, but on the use 
of dilatations. In fact, any manifold has a tangent space to any of its points, not only 
the Riemannian manifolds. We shall prove in this section that VT e G is isomorphic to 
the nilpotentisation N(G,D). 

Nevertheless G does not have the structure of a N(G, D) C 1 manifold, in the sense 



of definition 2.20 



We start from Euclidean norm on D and we choose an orthonormal basis of D. We 
can then extend the Euclidean norm to q by stating that the basis of q constructed, as 
explained, from the basis on D, is orthonormal. By left translating the Euclidean norm 
on g we endow G with a structure of Riemannian manifold. The induced Riemannian 
distance dn will give an uniform structure on G. This distance is left invariant: 



d R (xy,xz) = d R (y,z) 
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for any x,y,z € G. 

Any left invariant distance d is uniquely determined if we set d(x) = d(e,x). 
The following lemma is important (compare with lemma [2^ ) . 

Lemma 4.13 Let X\, X p be a basis of D. Then there are U C G and V C N(G, D), 
open neighbourhoods of the neutral elements ec cn respectively, and a surjective func- 
tion g : {1, M} — > {1, ...,p} such that any x € U , y S V can be written as 

M M 

x = Y[exp G (tiX g(i) ) , y = Yl^PN( T i X g(i)) (4.3.7) 

i=l i=l 



Proof. We shall make the proof for G; the proof for N(G, D) will follow from the 
identifications explained before. 

Denote by n the dimension of q. We start the proof with the remark that the 
function 



(ti,...,t n ) h-> JJexp G (£jXj) (4.3. 



is invertible in a neighbourhood of E R n , where the Xi are elements of a basis B 
constructed as before. Remember that each € B is a multi-bracket of elements 
from the basis of D. If we replace a bracket exp G (t[x, y}) in the expression ( 4.3.8[ ) by 



expc(tix) expc(t2y) expcit^x) expcit^y) and we replace t by (ii, £4) then the image 
of a neighbourhood of by the obtained function still covers a neighbourhood of the 
neutral element. We repeat this procedure a finite number of times and the thesis is 
proven. ■ 

As a corollary we obtain the Chow theorem for our particular example. 
Theorem 4.14 Any two points x,y € G can be joined by a horizontal curve. 

Let do be the Carnot-Caratheodory distance induced by the distribution D and the 
metric. This distance is also left invariant. We obviously have da < do- We want to 
show that dfi and dc induce the same uniformity on G. 

Let us introduce another left invariant distance on G 

d l G {x) = inf I I *i I : x = II exp G(^) ' Y i e D } 
and the auxiliary functions : 

A G {x) = inf jj^ I ti| : x = JJ ex.p G (UX g{i) ) j 

A G (x) = inf j max I j : x = exp G (^X gW )j 

From theorems |1.5| and |1.3j we see that d G = dc- Indeed, it is straightforward that 
do ^ d G . On the other part d G (x) is less equal than the variation of any Lipschitz 
curve joining cq with x. Therefore we have equality. 
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The functions A G , A g don't induce left invariant distances. Nevertheless they are 
useful, because of their equivalence: 

A%(x) < A G (x) < M A^(x) (4.3.9) 

for any x E G. This is a consequence of the lemma 4.13| . 
We have therefore the chain of inequalities: 

d R < d G < A G < MAq 



But from the proof of lemma 4.13 we see that A G is uniformly continuous. This proves 
the equivalence of the uniformities. 

Because exp G does not deform much the dn distances near e, we see that the group 
G with the dilatations 

5 e (exp G x) = exp G (<5 e x) 

satisfies HO, HI, H2. 

The same conclusion is true for the local uniform group (with the uniformity induced 
by the Euclidean distance) q with the operation: 

XY = log G (exp G (X)exp G (y)) 

for any X, Y in a neighbourhood of G (5. Here the dilatations are 5 £ . We shall denote 
this group by log GThese two groups are isomorphic as local uniform groups by the map 
exp G . Dilatations commute with the isomorphism. They have therefore isomorphic (by 
exp G ) virtual tangent spaces. 

Theorem 4.15 The virtual tangent space VT e G is isomorphic to N(G,D). More pre- 
cisely N(G,D) is equal (as local group) to the virtual tangent space to logG: 

N(G, D) = VT log G 



Proof. The product XY in log G is given by Baker-Campbell-Hausdorff formula 

X s Y = X + Y + ^[X,Y} + ... 



Use proposition L2 to compute (3(X,Y) and show that j3(X,Y) equals the nilpotent 
multiplication. ■ 

Let {Xi, ...,Xfji m } be a basis of g constructed from a basis {X%, ...,X P }, as ex- 
plained before. Denote by 

b ■ fl n(g,D) 

the identification of the vectorspace q with n(g,D) using the basis {X\, Xdi m }. 
The exponential and the logarithm of the groups G and N(G, D) will be denoted by 
exp G , log G and exp^, log^y respectively. 
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We shall consider the following N(G, D) atlas for G: 

A = {ijj x :U X C G —> N(D, G) : x £ G , 

ipx(y) = expjvo&ologG^x -1 ?/)} 

We shall study the differentiability properties of the transition functions. We want to 
work in g. For this we have to transport (in a neighbourhood € 5) the interesting 
operations, namely: 

X S Y = log G (exp G (X) exp G (y)) 
X"Y = log N (exp N (X)exp N (Y)) 
and to use instead of the chart ip x : XJ X C G —* N(D,G), the chart 4>x '■ U x C G — > g, 

where x = exp G X and <px(y) = (— X) ■ log G (y). 

The transition function from (j>x to 4>q is then the left translation by X, with respect 
to the operation We denote this translation by L s x . We want to know if this function 
is Pansu derivable from (g, ") to itself. See the difference: the function is certainly 



derivable (or commutative smooth) in the sense of definition 4.S , for the operation •, 
but what about the derivability with respect to the operation •? In order to answer we 
shall use the following trick: a C°° function / : N — > N is Pansu derivable if and only if 
it preserves the horizontal distribution and the derivative in each point is a morphism 
from N to N. 

The (classical) derivative of L x moves the distribution D n (Y) = DLy(e) into 
DL x D n (Y) C T^g^g. The horizontal distribution in X ■ Y, corresponding to the 

group operation is D n (X ■ Y). The difference between these two distributions is 
measured by one of the linear transformations: 

Axy :T By g^r v g 

A x ,y = {DI^p))- 1 D2^(0) (^^(O))" 1 (4.3.10) 

A x ? :g-g 

A X,Y = (DL« gy (0)) _1 J DL^ gy (0)( J DL?.(0))- 1 J DL?.(0) (4.3.11) 

Let then 3(G, D) be the Lie group generated by these transformations 

2(G,D) = ({A X > Y : X,Y Gg}) 

It is then straightforward to see that the algebra j(G, D) of this group contains the 
algebra generated by all the linear transformation with the form 



a x = ad^ - ad 1 ^ (4.3.12) 

The necessary and sufficient condition for the group J~(G, D) to be included in the 
group End(($, -),D) is that all the elements ax, to be in the algebra of the mentioned 
linear group. But this is equivalent with one of the (equivalent) conditions: 
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(i) Ad G 1S h ']n morphism, for any x in a neighbourhood of the identity, 

(ii) for any X, U, V € g we have the identity: 

[[X,U] G ,V] N + [U,[X,V] G ] N = [X,[U,V] N ] G (4.3.13) 

We collect what we have found in the following theorem. 

Theorem 4.16 Set Ad G to be the adjoint representation of G and Ad N the adjoint 
representation ofN(G,D), seen as group of linear transformations on g (via the iden- 
tification function b). The following are equivalent: 

(a) J(G,D) C End(g,"), 

(b) Ad G C End(Q,"), 

(c) the relation ( [4.3. 13| ) is true. 



If ( 4,3.13j ) holds then Ad G Ad N is a group and its Lie algebra is the adjoint represen- 



tation of the algebra q © g with the bracket: 

[(X,U),(Y,V)} = {[X,Y] g ,[X,V]g + [U,Y] g + [V,U] n ) (4.3.14) 



Proof. The equivalence of (a), (b), (c) has been explained. Suppose now that ( 4.3.13 ) 
is true. It is then straightforward to show that the space of all elements 

d(X,Y) = &dx — ady 
forms a Lie algebra with the linear commutator as bracket. Moreover, we have: 

i a (X,U)^ a (Y,V)} = a [(X,U),(Y,V)] 

This shows that AdF Ad N is a group and the property of its algebra. In order to finish 
the proof remark that ch(x,X) = a x, defined at ( 4.3.12 ). ■ 

Corollary 4.17 The atlas A gives to G a C 1 N(D,G) manifold structure if and only 
if ( |4.3.13|) is true and for any e > 0, X, Y G g 

6- x [X,8 e Y] N - 8- x [X,8 s Y] G = [X,Y] N - [X,Y] G (4.3.15) 
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Proof. The atlas A gives to G a C 1 N(D,G) manifold structure if and only if 
J{G,D) C HL(q, •) and any of its elements commute with dilatations 5 e . This 
is equivalent with (|4.3T3|) and ( p~3T^ ). ■ 

Remark 4.18 A Lie group is a manifold endowed with a smooth operation. In what 
sense is then G a (sub-Riemannian) Lie group? We already have problems to assign an 
atlas with smooth transition functions to G. The real meaning of the corollary is 
that even if we succeed to give to G an atlas with smooth transition functions then the 
left translations and the exponential map will not be smooth. This is a problem. If we 
renounce to give G a manifold structure, at least the operation is smooth in the sense 



of proposition 4-1 



Returning to the problem of the atlas, if N(G, D) is a Heisenberg group, or equiv- 
alently the distribution has codimension one, then the commentary of Bellaiche ||/, 
page 73, shows that such an atlas exists, as a consequence of Darboux theorem on nor- 
mal forms for contact differential forms. But such an atlas is not compatible with the 
operation. 

We shall see now how it can be possible that the operation is commutative smooth 
but the right translations are not. 

We shall look to the double G^. This group is isomorphic with G 2 = G x G with 
componentwise multiplication by the isomorphism 

F : G® _» G 2 , F{x,u) = (x,xu) 

The Lie algebra of G^ is easily then = q x q, with Lie bracket 

[(x,u),(y,v))(2) = ([x,u],{x + u,y + v] - [x,y]) 

Take in the distribution = D x D. This distribution generates q^ 2 \ and the 
dilatations are 

5^ 2 \x,u) = (5 £ x,5 £ u) 

We translate the distribution at left all over 

We know that the operation is commutative smooth. We shall check if the classi- 
cal derivative of the operation transports the distribution in the distribution D. 
Straightforward computation shows that 

D op(e,e)D ( - 2) = D 

where D here means classical derivative. Indeed, D op(e, e)(X, Y) = X + Y and D is 
a vector space, therefore (X, Y) G implies X + Y € D. 

For general (x, u) € G^ this is no longer true. Indeed, we have: 

D op(x,u)(Y,V) = DL^ u (0){Y + V + (Ad^ u -Ad^)Y} 

therefore a sufficient condition for D op(x, u)D^ = D is that x = e. 

In conclusion the fact that the operation is commutative smooth does not imply 
that so are the right translations. 
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We can hope that by a slight modification in the definition of smoothness we shall 
be able to give to G a manifold structure and simultaneously to have smooth right 
translations. For the manifold structure we have to "smooth" the group generated by 
the elements with the form 

{A<fi x T l Ad N x 

To solve the problem of the right translations we need that the group Ad G be "smoothed" 
Before doing this we shall see that the pure metric point of view does not feel these 
problems but in such a precise way. 

4.4 Metric tangent cone 

We shall prove the result of Mitchell p3] theorem 1, that G admits in any point a 
metric tangent cone, which is isomorphic with the nilpotentisation N(G,D). Mitchell 
theorem is true for regular sub-Riemannian manifolds. The proof that we give here is 



based on the lemma 4.13 and Gromov |13[ section 1.2. 

Because left translations are isometries, it is sufficient to prove that G admits a 
metric tangent cone in identity and that the tangent cone is isometric with N(G,D). 

For this we transport all in the algebra q, endowed with two brackets [•, -] G , [■, -]jv 
and with two operations • and ". Denote by <i G , d^ the Carnot-Caratheodory distances 
corresponding to the •, respectively " left invariant distributions on (a neighbourhood of 
in) q. / G , In are the corresponding length functionals. We shall denote by Bg(x, R), 
Bn(x,R) the balls centered in x with radius R with respect to da, djy- 

We can refine lemma 4.12 in order to obtain the Ball-Box theorem in this more 
general situation. 

Theorem 4.19 (Ball-Box Theorem) Denote by 

Box G (e) = {x G G : A G (x) < e} 

Box^ie) = {x € N : A x N {x) < e} 
For small e > there is a constant C > 1 such that 

exp G (Box l N (e)) C Boxq(Ce) C exp G (Box l N {C 2 e)) 



Proof. Reconsider the proof of lemma 4.13. This time, instead of the trick of replacing 
commutators exp G [X, Y] (t) with four letters words 

exp G X(ti) exp G Y(t 2 ) exp G X(t 3 ) exp G Y(U) 

we shall use a smarter replacement (which, important! , works equally for the nilpo- 
tentisation N(G,D)). 
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We start with a basis {Yy, Y n } of the algebra g, constructed from multibrackets of 
elements {X±, X p } which form a basis for the distribution D. Introduce an Euclidean 
distance by declaring the basis of orthonormal. Set 

[X(t),Y(t)]° = (tX) S (tY) S (-tX) B (-tY) 

It is known that for any set of vectors Z\, Z q £ g, if we denote by a°{Z\(t), Z q {t)) 
a [•, -]° multibracket and by a(Zi, Z q ) the same constructed [■, •] multibracket, then 
we have 

a°{Z x (t),...,Z q (t)) = t^a(Z 1 ,...,Z q ) + o(t") 

with respect to the Euclidean norm. 

Remark as previously that the function 

n 

(h,...,t n ) ^ HiUYi) (4.4.16) 

i=l 

is invertible in a neighbourhood of G R n . Each Xi from the basis of g can be written 
as a multibracket 

Xi = a^X^, ...,Xji) 
which has the length Zj = j l — 1. If /, is odd then replace itiXi) by 

a°(X il (t 1 /'*),...,A ii (t 1 /'*)) 

If li is even then the multibracket a« can be rewritten as 

oci(Xi 1 ,...,Xji) = /3i(Xi 1 ,...,[Xi k ,Xi k+l ],...,Xji) 

Replace then {UX-i) by 

f (3°(X h (\ t \W),... t [Xi h (\ t \ l Hx ik+1 (\ t t I 1 / 1 ')) if t > 

\ 0°(X h (\ t |V^),..., [ Xtk+1 (\ t \W),X ik (\ t |V^)]o,..., Xj ,(| t |V^)) if t < o 

After this replacements in the expression ( |4.4.16| ) one obtains a function Eg which 
is still invertible in a neighbourhood of 0. We obtain a function En with the same 
algebraic expression as Eg, but with [•, -]° N brackets instead of [-,-] G ones. Use these 
functions to (obviously) end the proof of the theorem. ■ 

Theorem 4.20 The Gromov-Hausdorff limit of pointed metric spaces (g, 0, Xdc) as 
A — > oo exists and equals (g,djv). 



Proof. We shall use the proposition 1.24. For this we shall construct e isometries 



between Box]^(l) and Box G (Ce). These are provided by the function Eg ° E^ 1 o 5 £ . 

The trick consists in the definition of the nets. We shall exemplify the construction 
for the case of a 3 dimensional algebra g. The basis of g is X±, X2, X3 = [X±, X2]g- 
Divide the interval [0, X\] into P equal parts, same for the interval [0, X2]. The interval 
[0, X3] though will be divided into P 2 intervals. The net so obtained, seen in N = g, 
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turn Eq o E^ o S £ into a e isometry between Box]^(l) and Box G (Ce). To check this 
is mostly a matter of smart (but still heavy) notations. ■ 

The proof of this theorem is basically a refinement of Proposition 3.15, Gromov [O], 
pages 85-86, mentioned in the introduction of these notes. Close examination shows 
that the theorem is a consequence of the following facts: 

(a) the identity map id : (g,da) — ► (fl^iv) has finite dilatation in (equivalently 
exp G : N(D, G) — > G has finite dilatation in 0), 

(b) the change from the G-invariant distribution induced by D to the ./V invariant 
distribution induced by the same D is classically smooth. 

It is therefore natural that the result does not feel the non-derivability of id map in 
x^O. 



4.5 Noncommutative smoothness in Carnot groups 

Let N be a Carnot group and 

N = Vi + ... + V m (4.5.17) 
the graduation of its algebra. Let End(N) be the group of endomorphisms of N. 



Definition 4.21 The group of vertical endomorphisms of N , noted by VL(N), is the 



subgroup of End(N) of all N endomorphisms A which admit the form A 
respect to the direct sum decomposition of N (4.5.17), such that 



(Aij) with 



id Vi , A^ =0 Vi < j G {1, ...,m} 



Proposition 4.22 (a) The elements of the linear group HL(N) have diagonal form 
with respect to the direct sum decomposition (4.5.17). 

(b) Any A E End(N) admits the unique decomposition 

A = A v A h , A v e VL(N) , A h e HL(N) 



(c) Ad(N) is a normal subgroup ofVL(N). The quotient group is Carnot. 



Proof, (b) Proof by induction. Just write what the algebra morphism condition for 
A € End(N) means and get the inferior diagonal form of A. Such a matrix (with ele- 
ments linear transformations) decomposes uniquely in a product of a vertical morphism 
A v and a diagonal morphism A h . Check that the latter commutes with dilatations. 

(a) We know from (b) that any element A £ HL(N) has inferior diagonal form. 
Commutation with dilatations forces A to have diagonal form. 
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(c) Ad(N) is normal in End(N) and it is a subgroup of VL(N). It is therefore 
normal in VL(N). The quotient group is nilpotent, because VL(N) is nilpotent. It is 
Carnot because it has dilatations. Consider the map 

5 £ : End(N) -> End(N) , 5 s A = 5 £ A5~ l 

From (b) we get that 5 £ is an endomorphism of VL{N). From the fact that 5 £ € End(N) 
we get that 

5 £ Ad(N) = Ad(N) 

Therefore 5 s factorizes to an endomorphism of the quotient VL(N)/Ad{N). The rest 
is trivial. ■ 

We shall call the quotient 

R(N) = VL(N)/Ad(N) 

the rest of N. 

The definition of noncommutative derivative follows. For any function / : N — > N 
and x e N set 

f x :N^N , f x (y) = /(x)- 1 /^) 

/ is Pansu derivable in x if and only if f x is Pansu derivable in 0; moreover the Pansu 
derivative of / in x equals the Pansu derivative of f x in 0. 

From the proof of Pansu's Rademacher theorem one can extract the following in- 
formation: 

Proposition 4.23 if f is Pansu derivable and the convergence 

hmS' 1 f x (xS e y) 

is uniform with respect to x then f is classically derivable in x along the horizontal 
directions and we have the connection between Pansu derivative and classical derivative: 

Pf(x) = Vf(x) = (DLf^y 1 (0)Df(x)DL x (0) 

Conversely if f is classically derivable along the distribution, such that the quantity 

V/(x) !o = (DLfw)- 1 (0)Df(x)DL x (0)\ D 

can be extended to an algebra morphism of N, an such that the convergence 

lim - (f(x5 £ y) - f(x)) 
e^O e 

(with y G D) is uniform with respect to x then f is Pansu derivable. 

Definition 4.24 A function f : N — > N is noncommutative smooth if for any B € 
VL(N) there is a continuous function Ab ■ N — > VL(N) such that: 
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(a) for any x E N the map 

y -> /f (y) = A{x)- l UBy) 

is Pansu derivable in 0, 

(b) the map 

( x , y ) e N 2 ^ (f(x),Df*(0)y) e N 2 

is continuous, 

(c) the convergence 

Df x (0)y = limj- 1 f x (x5 e y) 
is uniform with respect to x. 
The derivative of f in x is by definition 

Df(x) = Aj(x)Df^x) 



Note that if / is invertible then we have the decomposition: 
Df{xf = Aj(x) , Df(xf = Dfiix) 
Moreover, if / is invertible and classically derivable then the conclusion of proposition 



4.23| is still true, namely Df(x) = V/(x), where the "gradient" V/(x) is given by 

[DL f{x) y l (0)Df(x)DL x (0) 

The derivative is called noncommutative because it does not commute with dilata- 
tions. 

Proposition 4.25 If f : N — > N is commutative smooth then it is also noncommuta- 
tive smooth. 

If f,g : N — > N are noncommutative smooth then so it is f o g. 



Proof. Because the noncommutative derivative splits the function in a horizontal 
and a vertical part, it is sufficient to prove that if / is commutative smooth and g is 
noncommutative smooth such that D h g(x) = id then fog is noncommutative smooth. 
This is contained in the definiton of noncommutative smoothness. ■ 

The chain formula is true for noncommutative derivative. But obviously not for the 
vertical and the horizontal parts. 

With this definition of smoothness the right translations in a Carnot group become 
smooth. It is easy to show that 

D R x = Ad^ 1 , D V R X = Ad' 1 , D h R x = id 

This smoothness definition seems to be a good one for Carnot groups. It is not 
good enough though, as the following example shows. 
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4.6 Moment maps and the Heisenberg group 

Let G = T k @N be the direct sum of the k dimensional torus T k with the Carnot 
group N. The Lie algebra of G is g = R k © N (recall that we use the same notation 
for the Carnot group N and its Lie algebra). 

The algebra q is Carnot, with respect to the dilatations 

S e (T,X) = (eT,5 £ X) 

The distribution is generated by R k © V 1 and the nilpotentisation of G with respect to 
this distribution is N' = R k © H. 

The corollary 4.17 tells that G admits a N' manifold structure. As the construction 
of the noncommutative derivative is local in nature, we could reproduce it for N' 
manifolds. In this case that simply means: in reasonings of local nature one can work 
on N' instead of G. 

Take now N = H{n) and a toric Hamiltonian action 

(*!, s k ) £T k h+ (sii ..., Sfc) € Sympl(R 2n ) 
For any v G R k = Lie T k the infinitesimal generator is 

&>( X ) = 37 <t>exptv(x) 

at \ t =o 

Let H v be the Hamiltonian of t i — >■ (f> cxp tv The moment map associated to the Hamil- 
tonian toric action is then 

3^.R 2n -> [R k )* , (j(x),v) = H v {x) 

We shall associate to the toric action a function on T K x H(n). 

= T k x H(n) — > G , *(fl,x,z) = (s,^(x,x)) (4.6.18) 
where <j) s is the lift of (4> s ,0) (see theorem |3.3[ ) and it has the expression: 

(<j) s (x),x + F s (x)) 
Let us compute the "gradient" 

V$(s,xx) = (DL^x^iO))' 1 D<f>(s,x,x) DL {s>x>s) (0) 
In matrix form with respect to the decomposition of the algebra 

R k @H(n) = R k © R 2n © R 
we obtain the following expression: 

1 
V <I>K.r..r) - ( ^ D0 S 

a(s,x) 1 
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where a(s, x) is the function: 

< \ dF 1 U d( i ) \ 

We can improve the expression of the function a if we use the fact that the toric action 
is Hamiltonian. Indeed, by direct computation one can show that 

da 3 

This is equivalent with 

a(s,x) = \(s) - j^(<p a (x)) 
Easy computation shows that End(R k © H(n)) is made by all matrices with the 



form: 














( ; 


B 





\ a 


b 


c 



/? ^ , G HL(H{n)) , a <G (-R ) = ( Lie T 

It follows that End(R k © H(n)) and we already saw this in a particular form 
in proposition |3.6| . Therefore $ is not even noncommutative smooth in the sense of 
definition [4.24 

Suppose that the toric action is by diffeomorphisms with compact support. Then 
X(s) can be taken equal to hence a(x, s) = —j^{x). 

Proposition 4.26 The map <3? transforms the distribution 

D N (s,x,x) = l(S,Y,Y) : - \u{x,Y) + Y 
in the distribution 

D*(s,x,x) = |(5,y,Y) : (U(x),S) - \u{x,Y) + Y = o| 



Proof. By direct computation. ■ 

4.7 General noncommutative smoothness 

The introduction of noncommutative derivative makes right translations in Carnot 
groups "smooth". We shall apply the same strategy for having a manifold structure 
compatible with the group operation, in the case of a Lie group G with a left invariant 
distribution D. 

The transition functions of the chosen atlas have the form (L^) , with X € Q. 
These transition functions forms a group. Note that a natural generalization is the case 
where the transition functions forms a groupoid. This generalisation is not meaningless 
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since it leads to the introduction of intrinsic SR manifolds, if we identify a manifold 
with the groupoid of transition functions of an atlas. 

We want this family of transition functions to be "smooth". We shall proceed then 
as in the motivating example. 

The group G is identified with the group of left translations L G acting on the Carnot 
group (fl,")- These transformation are not smooth, they preserve another distribution 
than the N left invariant distribution on G. 



We shall give the following "upgraded" variant of the definition 4.24 , concerning 
the noncommutative derivative. 



n 



We shall denote by N the set g endowed with the nilpotent multiplication •. The 
same notation will be used for the Lie algebra (g, [■, -]n)- We fix on N an Euclidean 
norm, given, for example, by declaring orthonormal a basis constructed from multi- 
brackets from a basis of D. The Euclidean norm will be denoted by || • ||. 

We introduce the group of linear transformations D) generated by all trans- 

formations DL 9 x (0), DR 9 x (0), HL(N). In particular 5 £ G T,(G, D) for any e > 0. From 
the formula: 

DL N X {Q) = limS-'oDLl^oSe 

we get that DL^(0) G Z(G,D). In the same way we obtain DR^(0) G £(G,£>). 

For any F G D) we define the group N(F) to be the Carnot group N with 
the bracket: 

[X,Y]% = F[F^X, F~ x Y]jy 

and with the dilatations 

6? = F5 e F l 

This group is endowed with the Carnot-Caratheodory distance generated by the Eu- 
clidean norm and the distribution D N (F) = FD. 

We can see the class {N(F) : F G ~S(G,D)} in two ways, as an orbit. First, it is 
the orbit 

E(G,D) = {(F5 £ F-\F ad^F- 1 ) :FeE(G,D)} 
in the space GL(N) x gl(N,gl(N)), under the action of T,(G,D) 

F.(A,B) = (FAF^ 1 , F(B)) , F(B)(Z) = FB{F- l Z)F~ l 

Second, it is the right coset S(G, D)/HL(N). Indeed, if N(F) = N(P) then F~ l P G 
HL(N). 

Definition 4.27 A function f : N — > N is (G, D) noncommutative derivable inx G N 
if for any F G D) there exists P G S(G, D) such that f : N{F) -» N{P) is Pansu 
derivable in x. The derivative of f in x is by definition: 

Df(x)(F,P) = p- 1 oD Pansu f{x)oF 



The function f is noncommutative smooth if 
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(a) the map 

(x,y)£N 2 ^(f(x),Df(x)y)eN 2 

is continuous, 
(c) the convergence 

D Pansu f(x)(y = lim S' 1 ^ (PiP- 1 (fix))' 1 p- l f{x5^y))) 

is uniform with respect to x. 

If / is invertible then following map is well defined: 

A(f,x):£(G,D)^-£(G,D) 

A(f,x)(F HL(N)) = P HL(N), where / : N(F) -» N(P) is Pansu derivable in x. In 
this case the noncommutative derivative can be written as Df(x,N(F)). 
The definition can be adapted for functions / : N — > R, or / : R — > N. 

Definition 4.28 A function f : N — > R is noncommutative derivable in x if for any 
F £ D) the function f : N(F) — > R is Pansu derivable in x. In this case 

Df(x,N(F)) = Dp ansu f(x) o F 

A function f : R — > N is noncommutative derivable inx £ R if there exists P £ S(G, D) 
such that f : R — > N(P) is Pansu derivable in x £ i?. In i/tis case 

D/(a:) = r^Dta/W 

T/ie definition of noncommutative smoothness adapts obviously. 

We might risk to have no noncommutative derivable functions from N to R. On 
the contrary we shall have a lot of noncommutative derivable functions from R to N. 
What is the connection with the previous notion of noncommutative smoothness? 



The noncommutative smoothness in the sense of definition 4.24 corresponds to the 



definition 4.27 if we replace T,(G,D) with End(N). 



The following proposition has now a straightforward proof. 
Proposition 4.29 The transition functions of the N(G, D) atlas A are D) smooth. 



Proof. Indeed, it is sufficient to remark that J(G,D) C Y,(G,D). ■ 
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4.8 Margulis &c Mostow tangent bundle 



In this section we shall apply Margulis & Mostow [21] construction of the tangent 
bundle to a SR manifold for the case of a group with left invariant distribution. It will 
turn that the tangent bundle does not have a group structure, due to the fact that, as 
previously, the non-smoothness of the right translations is not studied. 

The main point in the construction of a tangent bundle is to have a functorial 



definition of the tangent space. This is achieved by Margulis & Mostow [21] in a very 
natural way. One of the geometrical definitions of a tangent vector v at a point x, to 
a manifold M , is the following one: identify v with the class of smooth curves which 
pass through x and have tangent v. If the manifold M is endowed with a distance then 
one can define the equivalence relation based in x by: c\ = x ci if ci(0) = C2(0) = x and 
the distance between c\(t) and o^ify is of order t 2 for small t. The set of equivalence 
classes is the tangent space at x. One has to put then some structure on the tangent 
space (as, for example, the nilpotent multiplication). 

To put is practice this idea is not so easy though. This is achieved by the following 
sequence of definitions and theorems. For commodity we shall explain this construction 
in the case M = G connected Lie group, endowed with a left invariant distribution D. 
The general case is the one of a regular sub-Riemannian manifold. We shall denote by 
d G the CC distance on G and we identify G with g, as previously. The CC distance 
induced by the distribution D N , generated by left translations of G using nilpotent 
multiplication -, will be denoted by djy. 

Definition 4.30 A C°° curve in G with x = c(0) is called rectifiable at t = if 
d G (x,c(t)) < Ct as t -> 0. 

Two C°° curves d , c" with c'(0) = x = c" (0) are called equivalent at x ij "i -1 d G (d '(t) , c" 
as t -> 0. 

The tangent cone to G as x, denoted by C X G is the set of equivalence classes of all 
C°° paths c with c(0) = x, rectifiable at t = 0. 

Let c : [—1,1] ->Gbea C°° rectifiable curve, x = c(0) and 

v = lim 8^ (ciO)- 1 ° c(t)) (4.8.19) 

The limit v exists because the curve is rectifiable. 
Introduce the curve Co (t) = xexp G (Stv). Then 

d(x,co(t)) = d(e, x~ 1 c(t)) <| v \ t 

as t — ► (by the Ball-Box theorem) The curve c is equivalent with cq. Indeed, we have 
(for t > 0): 

- t d G (c(t),c (t)) = - t d G (c(t),x S 5 t v) = itfcO^-yx-^c^.O) 
The latter expression is equivalent (by the Ball-Box Theorem) with 

jdNiStiv- 1 ) ^x- 1 ? c(t),0) = d N (6r 1 (6t(v- 1 )?6t(sr 1 (x- 1 ?c(tj)y)) 
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The right hand side (RHS) converges to d^{v~ 1 ■ v,0), as t — > 0, as a consequence of 
the definition of v and theorem 4,15 . 

Therefore we can identify C X G with the set of curves t i— » x exp G (<5fu), for all d 6 g. 
Remark that the equivalence relation between curves ci, C2, such that ci(0) = C2(0) = 
a; can be redefined as: 



lim 



tfr^oaCtr^dCt)) = (4.8.20) 



In order to define the multiplication Margulis & Mostow introduce the families of 
segments rectifiable at t. 

Definition 4.31 A family of segments rectifiable at t = is a C°° map 

T :U ^G 

where U is an open neighbourhood o/GxO in G x R satisfying 

(a) .F(-,0) = id 

(b) the curve t i— > T{x,t) is rectifiable at t = uniformly for all x £ G, that is for 
every compact K in G there is a constant Ck and a compact neighbourhood I of 
such that dG(y^(y,'t)) < CkX for all (y,t) G K x I. 

Two families of segments rectifiable att = are called equivalent if t~ 1 dc(J'i(x,t), J-2(x,t)) 
as t — > 0, uniformly on compact sets in the domain of definition. 

Part (b) from the definition of a family of segments rectifiable can be restated as: 
there exists the limit 

-i 



v{x) = Jimdp [x~ l ■ F{x,t)j (4.8.21) 

and the limit is uniform with respect to x € K, K arbitrary compact set. 
It follows then, as previously, that T is equivalent to Tq, defined by: 

Toixjt) = x®5 t v(x) 
Also, the equivalence between families of segments rectifiable can be redefined as: 

lim^ -1 (T 2 {x,t)- X • Fi(x,if) = (4.8.22) 
uniformly with respect to x € K, K arbitrary compact set. 

Definition 4.32 The product of two families T\, J-% of segments rectifiable at t = is 
defined by 

(T x oT 2 ){x,t) = F x {T 2 {x,t),t) 

The product is well defined by Lemma 1.2 op. cit.. One of the main results is then 
the following theorem (5.5). 
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Theorem 4.33 Let c\, c 2 be C°° paths rectifiable at t = 0, such that ci(0) = xq = 
C2(0). Let T\, J- 2 be two families of segments rectifiable at t = with: 

Fi(xo,t) = ci(t) , T 2 (x ,t) = c 2 (t) 

Then the equivalence class of 

t \-+ T\ oF 2 (xo,t) 

depends only on the equivalence classes of c\ and c 2 ■ This defines the product of the 
elements of the tangent cone C Xo G. 

This theorem is the straightforward consequence of the following facts (5.1(5) and 
5.2 in Margulis & Mostow [?]). 

Remark 4.34 Maybe I misunderstood the notations, but it seems to me that several 
times the authors claim that the exponential map which they construct is bi-Lipschitz 
(as in 5.1(4) and Corollary 4-5). This is false, as explained before. In Bellaiche ||/, 
Theorem 7.32 and also at the beginning of section 7.6 we find that the exponential map 
is only 1/m Holder continuous (where m is the step of the nilpotentization) . However, 
(most of) the results of Margulis & Mostow hold true. It would be interesting to have 
a better written paper on the subject, even if the subject might seem trivial (which is 
not). 

We shall denote by J- ~ J-' the equivalence relation of families of segments rec- 
tifiable; the equivalence relation of rectifiable curves based at x will be denoted by 
c w c . 

Lemma 4.35 (a) Let T\ ~ T 2 and Q\ ~ Q 2 . Then J-\ o Q\ ~ T 2 o Q 2 . 

(b) The map T mapstoJ-Q is constant on equivalence classes of families of segments 
rectifiable. 

Proof. Let 

Fo{x,t) = x ■ 5 t w\[x) , Go(x,t) = x ■ 5 t w 2 {x) 
For the point (a) it is sufficient to prove that 

T o g m J=~ o g 
This is true by the following chain of estimates. 

1 1 9 

jd G (Fog(x,t),J r og (x,t)) = jd G (5 t wi(go{x,t)y l ■ dtw^x)' 1 ■ x~ l ■ J r (g(x,t),t),o) 

The RHS of this equality behaves like 

d N (5^ (WSoOM))- 1 ^^(x) -1 ■ S t (6^ (x- 1 S(M))) 3 5 t fa 1 (^(x,*)- 1 S ^(x,t),t)))) ,0) 
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This quantity converges (uniformly with respect to x € K, K an arbitrary compact) to 
dN(wi(xy 1 ■ w 2 (x)~ 1 ■ w 2 (x) ■ wi(x), 0) = 

The point (b) is easier: let T ~ G and consider Tq, Go, as above. We want to prove 
that To = Go, which is equivalent to w\ = w 2 . 

Because « is an equivalence relation all we have to prove is that if JT ~ Go then 
W\ = w 2 . We have: 

^d G (T (x,t),Go(x,t)) = -d G {x ?5 t wi(x),x ? S t w 2 (x)) 

We use the • left invariance of d G and the Ball-Box theorem to deduce that the RHS 
behaves like 

d N {5^ (6 t w 2 (x)- 1 f Stwxix)- 1 ^) ,0) 

which converges to d]y(wi(x),w 2 (x)) as t goes to 0. The two families are equivalent, 
therefore the limit equals 0, which implies that w\{x) = w 2 (x) for all x. M 

We shall apply this theorem. Let Cj(t) = xo exp G 5tVi, for i = 1,2. It is easy to 
check that J-i(x,t) = x exp G (5tv j) are families of segments rectifiable at t = which 
satisfy the hypothesis of the theorem. But then 

{Ti o T 2 ) (x, t) = x exp G (S t vi) eyLp G (5 t v 2 ) 

which is equivalent with 

Texp G [S t (vi n v 2 ) 



Therefore the tangent bundle defined by this procedure is the same as the virtual 
tangent bundle defined previously. It's definition is possible because the group operation 
has horizontal derivative in (e, e). 

In terms of commutative derivative, theorem 10.5 Margulis & Mostow [^0| (and 
restricting to bi-Lipschitz maps) becomes the Rademacher theorem. Indeed, in the 
case of a Lie group G endowed with a left invariant distribution D, with an associated 
Carnot-Caratheodory distance, the definition 4.£ of commutative derivative can be 
adapted in the following way: 

Definition 4.36 Let G\, G 2 two groups endowed with left invariant distributions and 
d\, d 2 tow associated Carnot-Caratheodory distances. 

A function f : G\ — > G 2 is metrically commutative derivable in x G G\ if there is a 
e > such that the sequence 

A x f(x)u = Sfifix^fixSxu)) 

converges uniformly with respect to u E B(x,e). The derivative is 

Df(x)u = hmC 1 (/(x)- 1 /^)) 

£—►0 

and the virtual tangent is 

VTf{x) : VT xGl - VT f(x) G 2 , VTf(x)L^ = L^L^L^ 
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Theorem 4.37 Let (j) : G — > G be a bi-Lipschitz map with respect to the Carnot- 
Caratheodory distance on G induced by the left invariant distribution D. Then for 
almost all x £ G the map <\> is commutative derivable at x and the virtual tangent to <\> 
is an isomorphism of conical groups. 

We don't give the proof of this theorem. The reader might find interesting to adapt 
the proof of the mentioned Margulis &; Mostow theorem in our particular case. 
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